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PREFACE 


Work on this book was started when both of us were members of the 
staff of Iowa State College. At that time requests were received rather 
frequently from research workers. Some wanted advice on the conduct 
of a specific experiment: others, who had decided to use one of the more 
complex designs that have been discovered in recent years, asked for a 
plan or layout that could be followed during the experimental opera- 
tions. Although the logical principles governing the subject of experi- 
mentation are admirably expounded in Fisher’s book The Design of 
Experiments, these requests indicated a need for a different type of 
book, one which would describe in some detail the most useful of the 
designs that have been developed, with accompanying plans and an 
account of the experimental situations for which each design is most 
suitable. Such a book is directed at the experimenter and is intended 
to serve as a handbook which is consulted when a new experiment is 
under consideration. 

Mainly on account of the war, slow progress was made. In 1944 
we completed a mimeographed draft of which several hundred copies 
were distributed. Many helpful suggestions were made by readers. 
Of these, the most frequent was to the effect that we should include more 
material dealing with the statistical analysis of the results. In the 
mimeographed draft, our practice was to give references to worked 
examples of the analysis in cases where they could be found in the 
literature, and to present examples for only those designs whose analysis 
was not available. To this it was objected that many research workers 
did not have easy access to our references. 

This suggestion raised a difficult issue. To present a self-contained 
account of the analysis of variance in all its ramifications would make 
the book, it seemed to us, unwarrantably long and expensive. Conse- 
quently we have continued to assume that the reader has some knowl- 
edge of the principles of analysis of variance and of the computational 
methods involved. We have included a brief review of the basic theory 
and an extensive set of worked examples of the analysis for both com- 
mon and less common types of design. Although strenuous efforts 
were made to obtain a selection of examples from diverse fields of 
research, a preponderance from biology, and more particularly from 
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agriculture, was dictated by our own experience in those areas. On 
several occasions we rejected an example which would have made an 
attractive addition to the scope because we did not feel sufficiently 
familiar with the conditions of experimentation to give a realistic account 
of the problems encountered. 

Since courses of lectures on the design of experiments are being intro- 
duced in many colleges and universities, some teachers may be inter- 
ested in the potentialities of the book as a textbook. Several comments 
are prompted by our own experience in this connection. First, it will 
often be necessary for the teacher to provide a more systematic develop- 
ment of the analysis of variance than is given here. Second, interest in 
such a course is greatly enhanced by examples from an environment 
with which the listeners are familiar, and especially by examples of 
experiments that have been conducted by some of the listeners. Thus 
the teacher is well advised to build the course around his own examples, 
using those in this book mainly as supplementary material for the 
students. Finally, a selective use of the book is in order, because it 
contains much more material than can be covered in a typical one- 
quarter course, and because some difficult topics have been dealt with 
in the early parts of the book. 

We wish to express our gratitude to the many staff members at Iowa 
State College and North Carolina State College who helped us by putting 
experimental data at our disposal, by providing painstaking descrip- 
tions of their experimental techniques, or by lending their assistance 
in the preparation of the manuscript and plans. For similar kindnesses 
in connection with certain of the examples we are indebted to 
G. F. Potter, D. Y. Solandt, D. B. DeLury and J. Hunter, F. M. Wad- 
ley and C. F. Rainwater, and the Wailuku Sugar Company, Honolulu. 
Some theoretical results used in Chapter 14 were developed from research 
conducted at Raleigh under a contract with the Office of Naval Research. 
Finally, our thanks go to George W. Snedecor, who participated in the 
original plans for this book and made a careful reading of the first draft, 


W. G. Cocuran 
G. M. Cox 
January, 1950 
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CHAPTER 1 


INTRODUCTION 


1.1 The Contribution of Statistics to Experimentation 


1.11 The Problem of Interpretation. Since statisticians do not usually 
perform experiments, their claim to attention when they write on this 
subject requires some explanation. It is true that on many important 
aspects of experimentation the statistician has no expert knowledge. 
Nevertheless, in recent years, research workers have turned increasingly 
to statisticians for help both in planning their experiments and in draw- 
ing conclusions from the results. That this has happened is convincing 
evidence that statistics has something to contribute. 

At first, requests for assistance were nearly always concerned with 
the interpretation of the results. It is a common characteristic of ex- 
periments in widely diverse fields of research that, when they are re- 
peated, the effects of the experimental treatments vary from trial to 
trial. This variation introduces a degree of uncertainty into any con- 
clusions that are drawn from the results. Even after a number of repeti- 
tions, or replications as they are called, the investigator still does not 
know by how much his results would be changed if the experiment were 
repeated further under the same conditions. Successive trials may be so 
discrepant in their results that it is doubtful which of two treatments 
would turn out better in the long run. 

As an illustration of this variation, data are given in table 1.1 from a 
simple experiment. The data are the times (minus 2 minutes) required 
to perform a routine statistical calculation, that of finding the sum of 
squares of 27 observations. Each sum of squares was computed sep- 
arately by the same person on both of two standard computing machines. 
In all, 10 different sums of squares were worked, making 10 trials or rep- 
lications of the experiment. It will be noted that the differences in 
speed range from 17 seconds in favor of machine B to 2 seconds in favor 
of machine A. Some experimenters may comment that the results of 
this experiment are remarkably well behaved, and exhibit nothing like 
the variation with which they have to contend. The results will, how- 
ever, serve our purpose. 
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TABLE 1.1 Tre ry seconns (MINUS 2 MINUTES) REQUIRED FOR COMPUTING 
SUMS OF SQUARES 


Machine 
—_——— Difference 
Replication A B (A — B) 
1 30 14 16 
2 21 21 0 
3 22 5 a 
4 22 13 9 
5 18 13 5 
6 * 29 17 12 
7 16 7 9 
Shs 12 14 -2 
9 23 8 15 
10 23 24 -1 
Means 21.6 13.6 8.0 


The object of the experiment is, of course, to compare the speeds of 
the two machines for this calculation. More specifically, two objects 
might be stated. The first is to answer the question: is there any dif- 
ference in speed? or, to put it another way, to test the hypothesis that 
there is no difference in speed. The second object, which is related to the 
first, is to estimate the size of the difference in speed. Almost all ex- 
periments are carried out for one or both of these purposes—the testing 
of hypotheses and the estimation of differences in the effects of different 
treatments. 

As regards the test of the hypothesis that there is no difference in 
speed, we might report, as’ relevant evidence, that B proved faster 7 
times out of 10, A twice, while once there was a tie. For the problem of 
estimation we might report that the average difference in speed in the 
experiment was 8 seconds in favor of B. These purely descriptive state- 
ments do not carry us very far. Their weakness is that they supply no 
information about the reliability of the figures presented. For example, 
have we any confidence that, if the experiment were continued for an- 
other set, of 10 trials, the advantage at the end would still be close to 8 
seconds in favor of B? 

Because of the deficiencies in the descriptive approach, experimenters 
have adopted a different point of view in the summarization of their re- 
sults. They tend to reason as follows. Suppose that it were feasible to 
continue the experiment indefinitely under the same conditions. The 
average difference in speed between the two machines would presumably 
settle down to some fixed value. This.value, which will be independent 
of the size of the experiment that was actually carried out, may reason- 
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ably be called the true difference between A and B. From this point of 
view, the problem of summarizing the results may be restated in the 
question: what can we say about the true difference between A and B? 
This is a problem in induction from the part to the whole, or in sta- 
tistical language, from the sample to the population. A solution to this 
problem has been developed by means of the theory of statistics. It is 
this solution that constitutes the principal contribution of statistics to 
the interpretation of the results. 


1.12 Statistical Inference. Obviously, it cannot be expected that the 
solution will provide the exact value of the unknown true difference. As 
a less ambitious goal we might hope to be able to find 2 limits within 
which the exact value is certain to lie, but even this cannot quite be at- 
tained. What can be done is that for any chosen probability, say .95, 
two limits are found such that the probability that they enclose the true 
difference is .95. In other words, limits can be found that are almost 
certain to enclose the true difference, where the degree of certainty, as 
measured by the probability, can be chosen by the experimenter. Since 
we wish to focus attention on the type of inferential statement that can 
be made rather than on the method of calculating the limits, the com- 
putations will not be discussed at present. For the example in table 1.1 
they will be found in section 4.42. When the probability is .95, the limits 
for the true difference in speed between the 2 machines turn out to be 
3.3 and 12.7 seconds in favor of machine B. A statement that B is 
faster by an amount that lies between 3.3 and 12.7 seconds has a 1 in 
20 chance of being wrong. If the degree of certainty is decreased by 
lowering the probability to .8, the limits are narrowed to 5.1 and 10.9 
seconds. If the probability is raised to .99, the limits become 1.1 and 
14,9 seconds, and as the probability is brought closer to certainty, the 
limits steadily become farther apart. The limits are cailed confidence 
limits, and the probabilities are called confidence probabilities. * 

These probabilities are not merely academic abstractions: they can 
be subjected to at least a rough experimental verification, For verifi- 
cation, we need a situation where the true difference between the effects 
of two treatments is known. In toxicology, for instance, this situation 
can sometimes be obtained by diluting a standard poison to a known 
extent. The dilution is sent to the laboratory, labelled as an “unknown” 
poison. By experiments on animals, confidence limits are found for the 


* Fisher (1.2) has developed statistical inference in terms of fiducial limits. The 
two concepts, fiducial and confidence limits, have different logical backgrounds, al- 
though in all simple applications to controlled experiments the actual values of the 
fiducial and confidence limits are identical. For a discussion, see Kendall (1.1). 
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amount of the unknown that has the same toxicity as a given amount 
of the standard. Since the persons originating the experiment know the 
true value for this amount, they can verify whether the statement that 
the true amount lies within the limits is correct. The practical difficulty 
with this type of check is the labor required. A large number of experi- 
ments would be needed to verify whether about 95% of the statements 
made with confidence probability .95 were in fact correct, and about 
5% were wrong. 

As we have seen, the statistical solution to the problem of estima- 
tion consists of a statement that the true difference lies between certain 
limits, plus a probability that the statement is correct, It is of interest 
to consider whether this type of information is sufficiently precise to 
permit decisions of practical importance to be made. Although a 
thorough discussion of this question would be rather lengthy, inferences 
of this type often permit definite action to be taken with confidence 
that the action will be fruitful. When they fail to decide the point at 
issue, the reason is nearly always that the data obtained are insufficient. 
For illustration, suppose that it is desired to discover whether the ap- 
plication of a dressing of some fertilizer to a crop will be profitable. The 
cost of the fertilizer is such that its application will be profitable only if 
it increases yields on’the average by 2 or more bushels. A series of ex- 
periments is carried out in order to estimate the true average response 
to the fertilizer. If the 95% confidence limits for the increase due to the 
fertilizer are 4 and 11 bushels, its use can be adopted with a good deal 
of assurance that it will be profitable. Similarly, if the 95% confidence 
limits are —5 and 1 bushels, a decision not to use the fertilizer follows. 
A case where there is uncertainty occurs when the limits are 0 and 5 
bushels. Here it is likely that there will be either a small gain or a small 
loss, but no recommendation can be made without considerable risk of 
its being wrong. If it is important to make the correct decision, further 
experiments must be conducted in order to narrow the distance between 
the confidence limits. 

Thus far we have considered the problem of estimating the true dif- 
ference between the effects of two treatments. In testing hypotheses, 
we are interested in the supposition that the true difference has some 
specified value, most commonly zero. As in the case of estimation, diffi- 
culty arises because of the variability that is typical of experimental 
data. As a result of this variability, the data are never exactly in agree- 
ment with the hypothesis, and the problem is to decide whether the 
discrepancy between the data and the hypothesis is to be ascribed to 
these variations or to the fact that the hypothesis is not true. The con- 
tribution of statistics is the operation known as a test of significance. 
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Essentially, this is a rule for deciding, from examination of the data, 
whether or not to reject the hypothesis. Such rules are made to satisfy 
two conditions that are obviously desirable:_(i) hypotheses that are true 
shall be rejected only very occasionally, and the probability of rejection 
may be chosen by the experimenter; (ii) hypotheses that are false shall 
be rejected as often as possible. 

This technique enables the experimenter to test his hypotheses about 
the action of the treatments, with the assurance that there is little risk 
of erroneously rejecting a hypothesis that happens to be correct. Prob- 
abilities of .05 and .01 are most commonly used for this risk, and in 
these cases the tests are said to be made at the 5 and 1% significance 
levels respectively. These levels are just useful conventions, and a lower 
probability may be used if the consequences of an erroneous rejection 
of the hypothesis are very serious. It should be remembered, however, 
that in lowering this probability value we automatically diminish the 
chance of rejecting a hypothesis that is false. 

A useful property of a test of significance is that it exerts a sobering 
influence on the type of experimenter who jumps to conclusions on 
scanty data, and who might otherwise try to make everyone excited 
about some sensational treatment effect that can well be ascribed to the 
ordinary variation in his experiment. On the whole, however, tests of 
significance are less frequently useful in experimental work than con- 
fidence limits. In many experiments it seems obvious that the dif- 
ferent treatments must have produced some difference, however small, 
in effect. Thus the hypothesis that there is no difference is unrealistic: 
the real problem is to obtain estimates of the sizes of the differences. 

The construction of confidence limits may add something to the in- 
terpretation of a test of significance. For instance, suppose that the 
speeds of the 2 machines in the example had not been significantly dif- 
ferent. It is a commonplace that this result would not prove that the 
2 machines were identical in speed. However, if the 95% confidence 
limits for the difference in speed were —2 and +4 seconds, we might 
argue that a true difference of 4 seconds, even if it existed, would be of 
no practical significance. Consequently, it could be said that for all 
practical purposes the 2 machines are identical in speed. This is much 
more positive and useful than the mere statement that the difference in 
speeds was not statistically significant. Tf, on the other hand, the con- 
fidence limits were —30 and +32 seconds, there is no justification for the 
conclusion that the machines can be regarded as equivalent. All that 
we have learned is that the data are not sufficiently accurate to show 
whether there is a difference in speed that is of practical importance. 

- To summarize: variability in results is typical in many branches of 
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experimentation. Because of this, the problem of drawing conclusions 
from the results is a problem in induction from the sample to the popula- 
tion. The statistical theories of estimation and of testing hypotheses 
provide solutions to this problem in the form of definite statements that: 
have a known and controllable probability of being correct. These 
statements are specific enough to be useful in deciding whether action 
can be taken on the basis of the results. 


1.13 The Function of Randomization. As would be expected, the type 
of statistical inference that can be made from a body of data depends on 
the nature of the data. It is easy to conduct an experiment in such a 
way that no useful inferences can be made, and many of the experiments 
brought to the statistician, particularly in earlier years, have been of this 
type. To take a simple example, suppose that in the comparison of the 
calculating machines each sum of squares had been computed first on 
machine A and then on machine B. Now it is quite possible that in- 
creased familiarity with the data will enable the second computation to 
be done faster than the first. The advantage is unlikely to be great in 
such an easy calculation; it could be so if the computation were more 
difficult. (In the experiment this advantage was estimated at about 4 
seconds, though the confidence limits for the true advantage are rather 
far apart.) 

If the experiment is conducted in this way, the observed difference in 
speed (B — A) is an estimate of the true difference, plus the unknown 
difference in speed between a second calculation and a first. Confidence 
limits set up by statistical techniques apply not to the true difference 
between B and A but to the true difference plus this unknown advantage. 
Consequently, the limits tell us nothing definite about the true difference 
between B and A. The interpretation of tests of significance also be- 
comes confused. If A is found significantly faster than B by ordinary 
statistical tests, we can be confident that there is a real difference in 
favor of A, since A was handicapped in the course of the experiment. 
But if B is found significantly faster than A, we do not know what to 
conclude. In this case we are dealing with a bias whose nature can be 
anticipated before the experiment has started. In other experiments 
where less is known about the type of variability that is present, similar 
biases that are quite unexpected can occur from some apparently in- 
nocuous rule about the way in which different treatments are handled. 

In order to avoid these biases we need some means of insuring that 4 
treatment will not be continually favored or handicapped in successive 
replications by some extraneous source of variation, known or unknown. 
This is done by the device known as randomization, due to Fisher. In- 
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stead of performing every calculation first on machine A, we apply the 
principle of randomization by tossing a coin to determine whether A or 
B shall be used first in any trial. The decision is made independently 
in each trial. The effect is that in any trial each machine has an equal 
chance of being tested under the more favorable conditions. Of course, 
the result of any specific randomization may favor one or the other 
treatment. But this happens only to an extent that is allowed for in the 
calculations that are used for tests of significance and confidence limits. 
This important result has been illustrated in detail by Fisher (1.2), who 
has shown how tests of significance and confidence limits can be con- 
structed, using only the fact that randomization has been properly ap- 
plied in the experiment. Randomization is one of the few character- 
istics of modern experimental design that appears to be really modern. 
One can find experiments made 100 or 150 years ago that embody the 
principles that are now regarded as sound, with the conspicuous ex- 
ception of randomization. 

The occasions on which randomization is required vary with the type 
of experiment and must be left to the judgment of the experimenter. 
One occasion arises when the treatments are allotted to the experimental 
material. Suppose that the effects of different diets on the heights and 
weights of children are to be ascertained. Since different children grow 
at different rates, a treatment that happens to be assigned to a group of 
fast-growing youngsters is favored. Consequently, we allot the diets in 
any replication at random to the children who are to receive them, with 
a new random allotment in each replication. Similarly, if 4 different 
oven temperatures for the cooking of roasts are under comparison, the 
4 temperatures are assigned at random to the 4 roasts which form the 
material for any replication. Sometimes this is the only randomization 
required, but frequently other operations that are carried out in the 
course of the experiment are also potential sources of bias. With a 
repetitious operation the order of events may be important, either be- 
cause a learning*process is involved which tends to make later operations 
better than the earlier ones, or because fatigue tends in the opposite 
direction. Systematic biases may be guarded against by randomizing 
the order in which the operation is performed on the different treatments 
in a replication. In other cases the equipment that is used introduces 
variation. For example, if 4 ovens are available to cook the 4 roasts, 
we would not always use the same oven for the same temperature, in 
case biases should be introduced because of systematic differences among 
the ovens. Instead, the temperatures could be assigned at random to 
the ovens in each replication. Thus we have ‘wo randomizations, one 
to assign the temperatures to the roasts and one to assign them to the 
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ovens. If, however, we decide, before randomizing, which roast is to go 
in which oven, the two randomizations can be reduced to one. This 
method can always be used, if convenient, to cut down the number of 
randomizations that must be made. 

Randomization is somewhat analogous to insurance, in that it is a 
precaution against disturbances that may or may not occur and that 
may or may not be serious if they do occur. It is generally advisable to 
take the trouble to randomize even when it is not expected that there 
will be any serious bias from failure to randomize. The experimenter is 
thus protected against unusual events that upset his expectations. Of 
course in experiments where a great number of physical operations are 
involved, the application of randomization to every operation becomes 
time consuming, and the experimenter may use his judgment in omitting 
randomization where there is real knowledge that the results will not be 
vitiated. It should be realized, however, that failure to randomize at 
any stage may introduce bias unless either the variation introduced in 
that stage is negligible, or the experiment effectively randomizes itself. 


1.14 Restricted Randomization. We now reconsider the randortniza- 
tion proposed for the experiment on computing machines, in order to 
discuss a criticism that presumably has occurred to the reader. When 
this experiment is being planned, we possess the knowledge that there is 
some advantage to the machine on which the second calculation of any 
sum of squares is made. In the light of this knowledge, it seems wise to 
make sure that each machine receives the advantage in five of the 10 
replications, rather than to leave this to be decided by the tossing of 
coins. In fact, if the decision is made by tossing coins, the chances are 
only about 1 in 4 that each machine will be used first in five replications; 
they are also about 1 in 4 that one machine will be used first in 7 replica- 
tions. The criticism is that the randomization is unlikely to give as ac- 
curate a comparison as the proposed alternative. The criticism is sound, 
and the experiment was actually planned so that each machine would 
be used first in half the replications. 

Randomization was also applied in the design actually used, though it 
is a different type from that originally proposed, being subject to the 
restriction that each machine must appear first exactly five times. The 
process of randomizing therefore consisted in choosing five numbers at 
random from the numbers 1 to 10; these turned out to be 1, 3, 6, 8, and 9. 
Accordingly, machine A was used first in replications 1, 3, 6, 8, and 9. 
This is one of the cases where it is difficult to see why failure to random- 
ize could have led to any serious danger of bias. It probably would have 
been quite satisfactory to use machine A first in the first 5 replicates, and 
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machine B first in the remainder. Nevertheless, it might happen that 
the advantage in the second calculation of a sum of squares would 
diminish in later replications; and, to save the trouble of trying to guess 
whether biases of this type are likely, the randomization was used as a 
precaution. 

The example represents a fairly common situation, in which knowledge 
of the variation that will affect the results is sufficiently detailed so that 
the simplest type of randomization is objectionable on the grounds of 
accuracy. Most of the designs in this book were constructed for this 
situation. By suitable restrictions on the randomization, these designs 
enable the experimenter to utilize any knowledge that will increase the 
precision of the experiment. In addition, each design allows sufficient 
randomization to prevent biases from sources of variation about which 
knowledge is less certain. For each of the principal designs described, 
the appropriate method of randomization is also presented. 

One further point should be mentioned. We have seen that the ex- 
periment on computing speeds could have been carried out either by 
means of the simple randomization originally proposed or by means of 
the restricted randomization which ensures that each machine is used 
first five times. Both methods provide a valid test of significance and 
valid confidence limits. However, the calculations made for the test of 
significance and for the confidence limits are different in the 2 cases. 
The first experiment is of the type known as randomized blocks, and 
calculations are made as described in section 4.23; the second experi- 
ment is of the “cross-over” type, for which the calculations are given in 
section 4.42. This is an instance of the general rule that the way in 
which the experiment is conducted determines not only whether in- 
ferences can be made, but also the calculations required to make them, 
The experimenter must make sure that the calculations which he uses 
are appropriate for the experiment. 


1.2 Initial Steps in the Planning of Experiments 


1.21 Importance of the Initial Steps. It has been mentioned that 
statisticians are often asked for advice in making inferences from the 
results of experiments. Since the inferences that can be made depend on 
the way in which the experiment was carried out, the statistician should 
request a detailed description of the experiment and its objectives. It may 
then become evident that no inferences can be made or that those which 
can be made do not answer the questions to which the experimenter had 
hoped to find answers. In these unhappy circumstances, about all that 
can be done is to indicate, if possible, how to avoid this outcome in future 


10 INTRODUCTION 1.21 


experiments. Consequently, it has come to be realized that the time to 
think about statistical inference, or to seek advice, is when the experi- 
ment is being planned. 

Participation in the initial stages of experiments in different areas of 
research leads to a strong conviction that too little time and effort is put 
into the planning of experiments. The statistician who expects that his 
contribution to the planning will involve some technical matter in sta- 
tistical theory finds repeatedly that he makes a much more valuable 
contribution simply by getting the investigator to explain clearly why 
he is doing the experiment, to justify the experimental treatments whose 

* effects he proposes to compare, and to defend his claim that the com- 
pleted experiment will enable its objectives to be realized. For this 
reason the remainder of this chapter is devoted to some elementary 
comments on the subject of planning. These comments are offered with 
diffidence, because they concern questions on which the statistician has, 
or should have, no special authority, and because some of the advice is 
so trite that it would be unnecessary if it were not so often overlooked. 

It is a good practice to make a written draft of the proposals for any 
experiment. This draft will in general have three parts: (i) a statement 
of the objectives; (ii) a description of the experiment, covering such 
matters as the experimental treatments, the size of the experiment, and 
the experimental material; and (iii) an outline of the method of analysis 
of the results. 


1.22 The Statement of Objectives. This statement may be in the 
form of the questions to be answered, the hypotheses to be tested, or the 
effects to be estimated. The aim should be to make the statement lucid 
and specific. The most common faults are vagueness and excessive 
ambition, in the sense that a twenty-year research program would be 
required in order to realize the stated objectives. Often it is advisable 
to classify the objectives as major and minor, because some types of 
experiment give high precision for certain treatment comparisons but 
low precision for others. When the experiment represents a cooperation 
among people of different interests, this classification is particularly 
helpful in that it makes clear which objectives take priority and helps 
to avoid an unhappy compromise that is adopted in the hope of pleasing 
everybody. 

The statement should include an account of the area over which gen- 
eralizations are to be made, or in other words, of the population about 
which it is hoped to make inferences. If an experiment is to be con- 
ducted on persons suffering from some disease, are the results presumed 
to apply to the patients in a specific hospital, to patients in all hospitals, 
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or to all sufferers whether in hospital or not? This sort of question is 
crucial in applied research, where often the experimenter has some 
definite population in mind to which he would like to apply the results. 
It is obvious that worth-while inferences about an extensive population 
cannot be made from a single experiment. For instance, inferences made 
from the experiment on the two computing machines are restricted to 
the person who made the calculations and to the type of calculation 
made. There is no guarantee that the results would be the same for a 
different type of calculation or with different operators. Consequently, 
if the object were to find out which machine is the speedier in general 
use in a computing laboratory, the experiment that was made has only 
scratched the surface of the problem. 


1.23 The Experimental Treatments. We have used the term ¢reat- 
ments to denote the different procedures whose effects are to be measured 
and compared. In the selection of treatments it is important to define 
clearly each treatment and to understand the role that each treatment 
will play in reaching the objectives of the experiment. Some issues that 
arise in particular cases are as follows. 

1. Confusion is sometimes caused by failure to distinguish whether 
the object is merely to “spot the winner’’ among the different treatments, 
or whether in addition it is desired to find some clues as to why the 
treatments behave as they do. A good example is the experiment which 
demonstrates, or so we are informed, that each of the three treatments, 
whiskey and water, gin and water, and rum and water, taken orally in 
sufficient quantities, produces some degree of intoxication. By itself the 
experiment provides no information as to whether the intoxication is due 
to the water, to the other ingredient, or to the fact that the two are 
mixed, A more extensive experiment with additional treatments would 
be necessary to throw some light on this question. Although there are 
occasions when it is sufficient simply to discover which is the best of the 
treatments, experience suggests that even in strictly applied research 
progress is faster if experiments supply fundamental knowledge. Sim- 
ilarly, the criticism that a certain treatment should be omitted because 
it would not be used in practice is valid if the aim is to find the best of 
the “practical” treatments, but it is not valid if the “impractical” 
treatment will supply some needed information about the behavior of 
other treatments in the experiment. = 

2. The specification of the treatments may raise difficult questions 
about the conditions under which the treatments are to be compared. 
Suppose that the object is to find out the effect of an application of 
sulphate of ammonia on the yield of some crop. It is well known that 
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this effect is likely to depend on the amounts of other important plant 
nutrients that are available to the crop. Consequently, it must be 
decided whether to add these nutrients so that they will be in abundant 
supply on all plots, or to test the nitrogen on the soil as it happens to be. 
The decision must be guided by the objectives of the experiment, and 
more particularly by the type of population to which inferences are to 
apply. Sometimes it is advisable to test the nitrogen in both the pres- 
ence and the absence of the other nutrients. One result of this decision 
will be that the experiment becomes also a test of the other nutrients, 
even though this may not have been part of the original plan. Experi- 
ments of this type, where one factor is tested over different levels of 
another factor, are known as factorial experiments and have come to play 
a prominent part in experimentation. They are discussed more fully in 
chapter 5. 

3. In some cases it becomes apparent that the treatments that can 
be tested in practice are’not those that we would like to test. The follow- 
ing example is typical of many fields where human relations are involved. 
Suppose that it is desired to compare two methods A and B of teaching a 
foreign language, and that we have defined each method clearly and 
have decided how to measure the success achieved by each method. 
However, some of the teachers who are to participate in the test already 
use a method similar to method A and have strong beliefs that method 
B is inferior, whereas others already follow method B and have no use 
for method A. If the teachers are divided at random into the groups 
that are to teach the two methods, we may expect that some teachers 
will be assigned to teach a method in which they have no faith, and we 
may decide that this is not the kind of comparison that we had in mind. 
On the other hand, if we forsake randomization and allow teachers to 
use the method that they like, differences attributed to the methods may 
in fact be due to differences in the ability of the two groups of teachers. 
Given enough resources, the situation can be more thoroughly explored 
by having each method taught by those who like it, those who do not, 
and those who are neutral, so that there are six treatments instead of 
two. In practice, the resources may not permit this elaboration, and 
the issues that must be faced are how best to use the resources available, 
and whether the experiment, as it can be done, is worth doing. Here 
again, the deciding factor is usually the use to which the results are to be 
put. 

4. Discussion may arise as to the need for a “control,” this term being 
used rather loosely for a treatment in which we are not particularly in- 
terested, but which may be needed in order to reveal, by comparison, 
whether other treatments are effective. Suppose that we wish to com- 
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pare the effectiveness of three treatments that are qualitatively similar, 
for example, three nitrogenous fertilizers all of which supply the same 
amount of nitrogen. The control would be a ‘‘no-nitrogen” treatment. 
Three cases may be distinguished. (i) The effectiveness of this type of 
treatment has been consistently demonstrated previously, and it re- 
mains to discover which of the three qualities is best. There is no need 
for a control. (ii) The type of treatment is in general effective, but oc- 
casionally the conditions of the test are such that it is not. For ex- 
ample, nitrogenous fertilizers may fail to produce responses in fields 
where the soil fertility is very high. In this case it will be well to add a 
control, which serves primarily to describe the conditions under which 
the fertilizers were tested. (iii) It may not be known whether the type 
of treatment is effective. The control should of course be included, and 
there may be a case for giving it more replication than the other treat- 
ments receive. An experiment of this type is analyzed in chapter 3, 
where the treatments were 4 types of soil fumigant, and the control (no 
fumigation) was replicated 16 times as against 4 for the other treatments. 
The effect is to obtain a more accurate estimate of the average response 
to fumigants, at the expense of some loss in accuracy in the comparisons 
among the fumigants. 

An interesting example of the illumination that is sometimes produced 
by the inclusion of a control has been reported by Jellinek (1.8). A 
headache remedy contained 3 ingredients, a, b, and c. In order to test 
whether ingredients b and ¢ were necessary, the complete mixture (abe) 
was compared with (ac) and (ab). There were 199 subjects, each of 
whom was treated with each drug for a two-week period, the appropriate 
drug being given whenever the subject complained of a headache. Sue- 
cess was measured by the ratio of the number of headaches relieved to 
the number tested in the two-week period. The mean success rates were 
0.84 for (abc), 0.80 for (ac), and 0.80 for (ab); these figures show very 
little overall difference in effectiveness. 

The experiment also contained a “control’”—an inert mixture which 
looked the same as the others but had no active ingredients. This was 
tested under the same conditions as the 3 drugs. No less than 120 sub- 
jects, or about 60%, reported at least some of their headaches relieved 
by the control, a result that is itself of some interest. Further, the 
control enables us to isolate the group of 79 subjects whose headaches 
were not relieved by an inert mixture. For this group, the mean success 
rates were 0.88 for (abc), 0.67 for (ac), and 0.77 for (ab), and could be 
shown to be significantly different. As the author comments: “Banal as 
it may sound, discrimination among remedies for pain can be made only 
by subjects who have a pain on which the analgesic action can be tested.” 
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Tf a control is required, it must be an integral part of the experiment 
so that results for the control are directly comparable with those for the 
other treatments. This point tends to be overlooked in experiments 
with human subjects when it is difficult or troublesome to assemble the 
desired number of subjects. For example, if a new drug is to be tested 
in some ward of a hospital, the recovery rate in the ward before the drug 
was introduced is not a satisfactory control, nor is the recovery rate in a 
different ward where patients happen to be receiving the standard drug. 
An observed difference between the effects of the new and the standard 
drug might be due to differences in the severity of the disease or in the 
type of patient or in other aspects of the medical care in the two time- 
periods or the two wards. It is necessary to regard the new and the 
standard drug as two experimental treatments on an equal footing, and 
to use randomization, or one of the methods of restricted randomization, 
in assigning the two drugs to patients. 


1.24 Further Details. Other features of the experiment that should 
be included in the draft of the proposals are the number of replications, 
the types of experimental material to be used, and the measurements 
that are to be made. Since these primarily affect the accuracy of the 
experiment, discussion of them is deferred to chapter 2, which is devoted 
to the general question of accuracy. 

Finally, the draft should describe in some detail the proposed method 
for drawing conclusions from the results. This is the section that is 
most frequently omitted, though it is a valuable one. It may include a 
sketch of the analysis of variance, an indication of the tabular form in 
which results will be shown, and some account of the tests of significance 
to be made and the treatment differences that are to be estimated. In 
this process we verify which treatment comparisons are relevant to 
each of the stated objectives of the experiment. Attention is drawn to 
deficiencies in the set of treatments and to treatments that supply no 
information essential to the purpose of the experiment. 
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CHAPTER 2 


METHODS FOR INCREASING THE ACCURACY OF 
EXPERIMENTS 


2.1 Introduction 


The results of experiments are affected not only by the action of the 
treatments, but also by extraneous variations which tend to mask the 
effects of the treatments. The term experimental errors is often applied 
to these variations, where the word errers is not synonymous with 
“mistakes,” but includes all types of extraneous variation. Two main 
sources of experimental errors may be distinguished. The first is in- 
herent variability in the experimental material to which the treatments 
are applied. We shall use the term experimental unit to denote the group 
of material to which a treatment is applied in a single trial of the ex- 
periment. The unit may be a plot of land, a patient in a hospital, or a 
lump of dough, or it may be a group of pigs in a pen, or a batch of seed. 
It is characteristic of such units that they produce different results even 
when subjected to the same treatment: these differences, whether large 
or small, contribute to the experimental errors. The second source of 
variability is lack of uniformity in the physical conduct of the experi- 
ment, or in other words, failure to standardize the experimental tech- 
nique. 

Neither the presence of experimental errors nor their causes need 
concern the investigator, provided that his results are sufficiently ac- 
curate to permit definite conclusions to be reached. In many fields of 
research, however, with the time and labor that can be given to an ex- 
periment, the results are so greatly influenced by experimental errors 
that only treatment differences that are large can be detected, and even 
these may be subject to considerable uncertainty. As a consequence, 
the investigation of methods for increasing the accuracy of experiments 
has played a prominent part in experimental research in recent years. 

The methods may be classified broadly into three types. The first is 
to increase the size of the experiment, either through the provision of 
more replicates or by the inclusion of additional treatments. The second 
is to refine the experimental technique. Thirdly, we may handle the 
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experimental material so that the effects of variability are reduced. 
This may be done by careful selection of the material, by taking ad- 
ditional measurements that provide information about the material, or 
finally by skillful grouping of the experimental units in such a way that 
the units to which one treatment is applied are closely comparable with 
those to which another treatment is applied. These various methods are 
discussed in more detail in succeeding sections. 

At this point it is advisable to discuss briefly the concepts accuracy and 
precision. Although these words are regarded as roughly synonymous - 
in most dictionaries, they have sometimes been assigned technical mean- 
ings, particularly in the natural sciences and engineering. The difference 
between the concepts may be illustrated by an experiment in which 
children are weighed on a machine that has a bias of 1lb, That is, if the 
true weight of a child is 46 Ib., repeated weighings of the child give read- 
ings which vary around 47 lb. The accuracy of a measurement of the 
weight of the child signifies the closeness with which the measurement 
approaches the true weight, 46 lb. The term precision, on the other 
hand, is concerned merely with the repeatability of the measurements. 
Thus the precision of a measurement denotes the closeness with which 
the measurement approaches the average, 47 Ib., of a long series of 
measurements made under similar conditions. It is clear that, if the 
bias is large, a measurement may be of high precision but of low ac- 
curacy.* 

Most of the devices mentioned in this chapter, including replication, 
additional measurements, and skillful grouping of the experimental ma- 
terial, operate on the precision of the experiment. In so far as the 
method of measurement remains biased, these procedures do not affect, 
this bias. A refinement in technique may reduce it. Other things being 
equal, an increase in precision is accompanied by an increase in ac- 
curacy, though if a large bias is present a substantial increase in pre- 
cision may result in a trivial increase in accuracy. 

For simplicity in presentation, we shall sometimes speak as if no bias 
is present, that is, as if precision and accuracy are identical. Thus the 
phrase “true difference between two treatments” will mean the true 
difference as recorded by the measuring device that was actually used. 
This will be identical with the natural concept of a “true difference” 
only if the measurements are not biased in favor of one of the treatments. 


* For a discussion of the difficult problem of measuring precision and accuracy 
see Shewhart (2.8). 
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2.2 Number of Replications 


Whatever the source of the experimental errors, replication of the 
experiment steadily decreases the error associated with the difference 
between the average results for two treatments, provided that pre- 
cautions (such as randomization) have been taken to ensure that one 
treatment is no more likely to be favored in any replicate than another, 
so that the errors affecting any treatment tend to cancel out as the 
number of replications is increased. The rate at which the experimental 
error is reduced is predictable from statistical theory. The basic quan- 
tity used to measure experimental errors is the error variance per ex- 
perimental unit, which is defined as the expected value of the square 
of the error that affects the observation for a single experimental unit. 
The square root of this quantity is called the standard error per unit. If 
o” is the error variance per unit and there are r replications, the error 
variance of the difference between the means for two treatments is 
20°/r, and the corresponding standard error is V2¢2/ Vr. This result 
is valid unless increased replication necessitates the use of less homo- 
geneous experimental material or of a less careful technique. 

In succeeding sections some advice is given on the choice of the number 
of replications for an experiment. ‘To present realistic advice is not 
easy. Often the size of an experiment is limited by lack of resources 
or by the conflicting claims of other experiments. However, even if it is 
realized that an experiment must fall short of the precision desired, it is a 
good practice to try to estimate the degree of precision that will be at- 
tained and to present this information as part of the proposal for the 
experiment. 


2.21 Number of Replications for Tests of Significance. Consider the 
difference between the effects of a pair of treatments, which might be a 
standard procedure and some new procedure which it is hoped will prove 
better than the standard. The precision desired in an experiment may 
be specified in either of two ways. We may specify the size of the true 
difference which the experiment is to detect by means of a test of sig- 
nificance, or we may specify how closely we wish to estimate the true dif- 
ference, by stating the width of the confidence interval that we would 
like to have for the true difference. 

Although the specification used is to some extent a matter of individual 
taste, the approach by means of a test of significance is helpful mainly 
in the initial stages of a line of research. The reasoning in this approach 
is roughly as follows. If it can be established that the new procedure is 
superior to the standard procedure by at least some stated amount, say 
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20% of their mean, then we will have discovered a useful result. On 
the other hand, if the first experiment shows no significant difference 
between the two treatments, we will be discouraged from further in- 
vestigation of the new treatment. Consequently, the first experiment 
must be large enough to ensure that if the true difference is 20% or more, 
it is highly probable that we will obtain a significant difference. Note 
that in this approach we do not insist that the first experiment give a 
precise estimate of the true difference, but merely that it give a signifi- 
cant result at the chosen level of significance. 

The probability of obtaining a significant result depends on the true 
standard error o per unit, the number of replications 7, and the number 
of degrees of freedom n that the experiment provides for estimating the 
error variance. The exact calculation of the probability is rather com- 
plicated; tables have been given by Neyman et al. (2.1) and others. The 
argument used in the following example, though logically faulty, leads 
to an approximation that is good enough for most practical purposes. 

Example. The true standard error per unit is 12% of the mean of the 
experiment, and there are 8 replicates and 28 d.f. for the estimate of 
error variance. What is the probability of obtaining a significant result 
when the true difference is 20%? 

Let d be the observed difference and 6 the true difference between the 
mean of the new treatment and that of the standard treatment. The 
true standard error of d is 


= fs in, - 0% 


In practice we do not know a, and in its place we use the estimated 
standard error per unit s, obtained from the analysis of variance of the 
results. We will assume that s happens to be equal to . 

The test of significance of d is made by means of a Student t-test, with 
28 d.f. To find the value that d must attain in order to be significant, we 
multiply the standard error of d, 6%, by the significant value of ¢. 
Suppose that the test is made at the 5% significance level, and that it is 
a one-tailed test.* The 10% value of ¢ in the standard tables, for 


* A one-tailed test is used when it is known that the new treatment must be at 
least as good as the standard procedure. The difference d is declared significant 
only if d is sufficiently large and positive; if d is negative we ascribe the result, to ex- 
perimental errors, since we do not entertain the possibility that the new treatment 
could be inferior. A two-tailed test is made when we do not know which treatment 
is better; Jarge values of d, whether positive or negative, are declared significant. 
All standard tables of the significance levels of t are computed for two-tailed tests. 
For a one-tailed test we read the value of ¢ corresponding to twice the significance 
probability. 
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28 d.f. is 1.701, so that if d is to be significant we must have 


2 
d= ea = (6)(1.701) = 10.206 
r 


The problem is to find the probability that this happens. 
The quantity 


follows the ¢-distribution with 28 df. But it is evident that 


d—20_ 10.206 — 20 
= py = 


d > 10.206 if and only if t& = gue ; 1.632 


Hence the probability which we seek is the probability that a value of 
t, with 28 d.f., should exceed —1.632. In the table of ¢, the probability 
that a value lies outside the limits +1.632 is about .11. The proba- 
bility wanted is therefore [1 — (14)(0.11)], or about .945. It is highly 
likely that a significant difference would be found in this experiment. 

The flaw in the argument is the assumption that the value of s can 
be assumed equal to c, whereas s is actually a random variable. If the 
argument is carried through algebraically, it gives the following rule for 
calculating the approximate probability P of obtaining a significant re- 
sult. 

Step 1. Find tg from the relation 


2 
i= Pets + te) (2.1) 


where 6 = true difference that it is desired to detect. 
o = true standard error per unit. 
r = number of replications. 
t, = significant value of ¢ in the test of significance (with care to 
distinguish between one-tailed and two-tailed tests). 
Step 2. 
P=1-—(4)pe (2.2) 


where 2 is the probability corresponding to fy in the ordinary t-table. 
The degrees of freedom in ¢; and fy are those available for the estimate of 
error variance. 
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TABLE 2.1 NuMBER OF REPLICATIONS REQUIRED FOR A GIVEN PROBABILITY OF 
OBTAINING A SIGNIFICANT RESULT 


Upper figure: Test of significance at 5% level, probability 80% 
Middle figure: Test of significance at 5% level, probability 90% 
Lower figure: Test of significance at 1% level, probability 95% 


One-tailed tests 
‘True differ- True standard error per unit (c) as percent of the mean 
ence (8) as 
percent of 
the mean | 2| 3 4) 5 6] 7 8] 9 10] 11 12] 14 16/18 20 
5 8} 6 9|18 19] 25 33) 41 60 
4) 7 12]18 26) 35 45 
7| 18 22) 33 47 
10 2| 2 3] 4 6] 7 9] 11 18] 16 19) 25 33/41 50 
2} 3 4| 5 8| 9 12] 15 18] 22 26/35 45 
8| 4 7| 9 18|17 22) 27 33 | 40 47 
15 S/F Bi 8 S| &. Bi) & 7) 8-8} ae 416 | 1a, 23 
2|2 3] 3 4| 5 6| 7 9/10 12] 16 21) 26 31 
2| 3 4] 5 7] 8 10] 18 15] 18 22] 29 37 | 47 
20 BAe aes eae Kak el BG Ae IP aL) As 
2|2 2) 2 8] 8 4] 5 5| 6 7} 9 12/15 18 
2| 3 3] 4 4] 5 7) 8 911 13)17 22/27 33 
25 PRR We 4 ie Pe 2) (ee ome Ve J ly Cie) 
Dl Se Bl 2. PIS Bi 8. 48) bs) 7 8) tO a 
Seo. SNS Bi £ wh) 6 eT eB Oe wes 22 
30 eee el Ue a eM a) tS Sp 
Bl Dts Nee of Bi] Die Bl By BB 4 Ah BSL! Niet 19 
BS DNS  SifeBy Aw 6B) BE | see a0 es 


The rule may be inverted in order to give the number of replications 

r required for a given probability P of obtaining a significant result. In 

this form we have : 
o 

r>2 () (4 + te)? (2.3) 


where 5 = true difference that it is desired to detect. 
o = true standard error per unit. 
4, = significant value of ¢ in the test of significance. 
to = value of ¢ in the ordinary table corresponding to 2(1 — P). 


——— ee 
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TABLE 2.1 (Continued) 
Two-tailed tests 


True differ- ‘True standard error per unit (c) as percent of the mean 
ence (6) as 
percent of 
the mean |2/ 3 4 56 6/7 8&8] 9 10] 11 12/14 16] 18 20 
5 4 8 AV 7) 2451182 a 
5] 9 15] 22 81 | 42 
7| 14 24 | 38 
10 2; 38 4/ & 7] 9 11] 14 17] 20 24] 32 41 
2} 3 &/] 7 9/12 15/18 22] 27 31] 42 
3/ 5 7/11 14/19 24] 30 37 | 45 
15 2); 2 3/ 3 4] 5 6] 7 8/10 11]15 19] 24 29 
2; 2 3/ 4 5] 6 7] 9 11] 138 15] 19 25] 81 389 
3/ 3 4/ 6 7] 9 12/14 17] 21 24] 33 42 
20 22) 21 8 ES Bie 45) BB Be FN Oa ay, 
2); 2 2/ 8 38] 4 5] 6 7| 8 9|12 15) 18 22 
2} 8 8) 4 5] 6 7] 9 11] 12 14/19 24) 30 37 
25 Dea ei ol BB Be eae) 61 Sa 
2 Be Bh 2 i) Bi 4 | 4) Bi]. BB) Bs 40" WB tb: 
2) 2 3) 38 4] 5 5] 6 7] 9 10] 18 16) 20 24 
30 By ee EB BA "S| BY Bi a a: Beigua as 
See? 2) Be 2 Be BL) 6B 4 ae B8 "7 ne ae 
oe) foe’ Jost al Uy: Jaleo ee: Sime al OC am Py Ki em fl ek Pm 2 PY Be 


Notes. In constructing the table, it was assumed that the number of degrees of 
freedom for error is 3(r — 1); this would apply in a randomized blocks experiment 
with 4 treatments. 

No entries are given where more than 50 replications are required. 


The use of this equation is slightly tricky, because the number of 
‘degrees of freedom in é and fg depends on r. Trial and error may be used 
until the smallest r satisfying (2.3) is found. 

Example. In the previous example, how many replications are re- 
quired in order to have a chance of 4 in 5 that a significant result will be 
obtained? Here we have 5 = 20, « = 12, and we will suppose that the 
experiment has 4 treatments in randomized blocks, which gives n = 
3(r — 1) degrees of freedom for the estimate of error. In the ¢-table, 
t, is the value for probability .10, while f is the value for probability 
2(1 — .80) or .40. 
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In order to start the calculation some guess must be made about the 
value of n. It does not matter greatly if the guess is inaccurate; we will 
try n = 30. For this, 


t, = 1.697; ft = 854; sothat (4 + te) = 2.551 


Hence 
r > 2(1249)?(2.551)? = 4.7 


Consequently our second trial value isr = 5. Since this provides only 
12 instead of 30 d.f., we must repeat the calculation to verify whether 
5 replications are sufficient. For 12 d.f. (t + ta) is 2.655, Thus 


‘o\? 12\? 
2(-)¢ 2 = () 2.655)? = 5.0 
(2) ta +t? = 2(5) e055) = 508 


Since this is greater than 5, we conclude that 5 replications are not quite 
enough, and that 6 is the smallest number of replications that will 
satisfy the conditions. We can be sure without further computation 
that 6 will satisfy the conditions, because the right-hand side of the in- 
equality always decreases when we increase r. 

Table 2.1, computed by this method, shows the minimum number of 
replications for a range of values of 6 and o, and is intended to be used 
in the planning of experiments. Usually it is most convenient to think of 
6 and o expressed in percentages of the mean per observation for the 
experiment, and the range in the table has been constructed with this 
in mind. Since the value of r depends only on the ratio «/6, it does not 
matter what units are used for 5 and c, provided that they are the same 
for both. There are three entries for any given pair of values of 6 and o. 
These show, in descending order, the replications required for a test of 
significance at the 5% level and a probability .80 of getting a significant 
result, for a 5% level and probability .90, and for a 1% level and prob- 
ability .95. The top half of the table applies to one-tailed tests, the 
lower half to two-tailed tests, 

Construction of the table requires some assumption about the size of 
the experiment. In the case presented the experiment has four treat- 
ments in randomized blocks, so that n = 3(r — 1). The values in the 
table change little if the experiment happens to provide more than this. 
If the experiment has only two or three treatments it is advisable to 
check the value of 7 by means of the formula. 

In the practical use of the table it is necessary to estimate the value of 
the true standard error per unit. Often this can be done fairly well from 
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the results of previous experiments with the same kind of material. It 
should be realized that the probabilities to which the table applies will 
be correct only if the value of o is estimated correctly. If o is under- 
estimated, the probability of obtaining a significant result will of course 
be smaller than the presumed value. For cases where it is highly im- 
portant that the probability be correct, more exact methods are dis- 
cussed in section 2.23. 

As an illustration of the reading of the table, consider the case in the 
last example, where 6 = 20,0 = 12, P = .80, and the test is a one-tailed 
test at the 5% level. For this we read the top entry in the top half of 
the table for 6 = 20, « = 12, and find that 6 replications are necessary. 
From the middle figure we see that an increase to 7 replications in- 
creases the probability to .90. If the test is to be two-tailed, we read at 
the corresponding place in the lower half of the table, and find that 7 
replications are required for a probability .80 and 9 for a probability .90. 

The table brings out the value of any reduction in the standard error 
per unit. In fact, we cannot have a high probability of detecting a 5% 
difference with any reasonable number of replications unless the stand- 
ard error per unit is 4% or under. Differences of 20% or over can be 
detected in most cases without excessive replication. No entries have 
been inserted in the table for situations where more than 50 replications 
would be required, since experiments of this size are uncommon. 


2.22 Number of Replications for Prescribed Limits of Error. The 
confidence limits for 6, the true difference between the effects of two 


treatments, are 
2 
6=d+ , /-st (2.4) 
f, 


where ¢ is the value in the ¢-table corresponding to the confidence prob- 
ability chosen and to the number of degrees of freedom n in the estimate 
of error. The quantity Vv 2s1/ Vr may be called the limit of error, since 
it measures the maximum distance between d and 6 for the confidence 
probability chosen. Since the limit of error depends on s, it is not known 
until the experiment is completed. If s is replaced by its average or 
expected value, we obtain the expected limit of error 


Tes ft fot (2.5) 
r 


24 


The factor f enters because the average value of s is slightly less than o; 


1 
to a close approximation, f equals ( - x 
1. 


From relation (2.5) table 2.2 was constructed to show the numbers of 
replications required for various values of L and for confidence probabil- 


TABLE 2.2 NuMBER OF REPLICATIONS REQUIRED FOR GIVEN LIMITS OF ERROR IN 
THE DIFFERENCE BETWEEN TWO TREATMENTS 


Upper figure: Confidence probability .80 
Middle figure: Confidence probability .90 
Lower figure: Confidence probability .95 


METHODS FOR INCREASING ACCURACY 


2.22 


Limit of 
error (++) 
as percent 
of the mean 


10 


True standard error per unit (¢) as percent of the mean 


2/3 °4) 5 6) 7 8) 9 10; 11 12) 14 16.) 18 20 
3] 4 7) 10 14)19 24) 31 37 | 45 

4) 6 11] 16 23) 31 40] 50 

5] 9 15 | 23 32 | 48 

2} 3 4] 6 8)11 14] 18 21 | 26 81) 41 

3/ 4 6] 9 13/18 23] 29 35 | 42 50 

3] 6 9] 13 19] 25 32] 40 50 

2} 2 8] 4 6/] 7 9] 12 14/17 20] 27 35] 44 

2} 3 5] 6 9)12 15] 19 23 | 27 32) 44 

3/ 4 6| 9 12/16 21] 26 32/39 46 

2); 2 8] 3 4] 5&5 7] 8 10/12 14]19 24/31 38 
2) 3 4/ 5 6) 8 11] 13 16/19 23] 31 40} 50 

2) 3 5] 7 9/12 15] 19 23) 27 32] 48 

BBD) 2 BN Ba BB 7 28.) TL Aa ag! Bt 
2) 2 3) 3 4] & 6] 8 9/11 13)18 28)]:29 35 
2) 8 3] 4 6) 7 9) 11 13] 16 19] 25 82] 40 40 
Die Bie Bs) 22 |B eB aw AB BLY Ode. Ja 
2) 2 2) 3 3) 4 5) 5 6] 8 9) 12 15] 19 938 
2) 2 3) 8 4) 6 6) 7 9 / At 2'| 16° 21 }98) 32 


ities of .80, .90, and .95. As in table 2.1, the experiment was assumed to 
have 3(r — 1) degrees of freedom in the estimate of error; no entries were 
inserted where more than 50 replications are necessary, and the values 
of L and o are expressed as percentages of the mean for the experiment, 
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This table may be used in the planning of an experiment where the 
object is to estimate the difference in the effects of two treatments. Sup- 
pose that the true standard error per unit is expected to be about 6%, 
and that preliminary experiments have indicated that the true difference 
between the effects of the two treatments is around 25%. In this case 
we would probably not be satisfied with a value of Z any higher than 
5%; if L is 5% and if the observed difference is found to be 22%, we 
are confident that the true difference lies between (22 + 5)%, that is, be- 
tween 17 and 27%. From the entry for = 6, L = 5, we find that 6 
replications suffice for a confidence probability of 80%, 9 for 90%, and 
12 for 95%. 

It is important to reiterate that this table does not guarantee that the 
limits of error will be of the desired size, even if the value of o is fore- 
casted correctly. The actual limits will depend on s and may be either 
larger or smaller than Z. They will be about the desired size on the 
average if « is guessed correctly. Further, the table makes it clear that 
small treatment differences of the order of 5% cannot be accurately 
estimated in a single experiment, at least with the values of o and r that 
normally prevail in many lines of research. For an accurate estimate of 
a 5% true difference, we would probably not want L any larger than 1%. 
This value was not included in the table since it demands great numbers 


of replications. 


2.23 The Case Where Additional Assurance Is Desired. We have 
seen that tables 2.1 and 2.2 do not ensure that the desired standards of 
accuracy will always be attained. In table 2.1 the probability of obtain- 
ing a significant result is as stated only if the value of o is correctly 
guessed. In table 2.2, if o is correctly guessed, the limits of error will 
equal the desired L on the average, but for a single experiment may be 
either higher or lower. In certain circumstances, methods are available 
which provide a more definite assurance. 

If an initial estimate 8,” of 0”, based on n; degrees of freedom, has been 
obtained from preliminary data, Harris, Horvitz, and Mood (2.3) present: 
tables that give the number of replications required so as to be certain 
that the probability of obtaining a significant result will have the de- 
sired value. No guesswork about ¢ is involved, though if n, is small the 
number of replications required tends to be large so as to allow for the 
possibility that s,? is a very poor estimate of o?. With the same pre- 
liminary data a value of r may be found such that the probability is 
.05 that the limits of error exceed ZL. If in default of an estimate s,? we 
can give a range of values within which o? is likely to lie (e.g., from pre- 
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vious experiments), the authors suggest a simple method for using their 
tables. 

An alternative approach, due to Stein (2.4), is applicable when the ex- 
periment can be interrupted after a given number of replications. This 
approach ensures that the limits of error will not exceed L. The pro- 
cedure is as follows. We first complete a number of replications some- 
what less than the number which we expect will be necessary. At this 
stage we compute the limits of error. If these are already less than L, 
the object is achieved. If they are greater than L, Stein gives a simple 
rule for computing the additional number of replications needed to give 
limits that are less than L. 

Naturally, these methods require more replications than tables 2.1 
and 2.2. In fact, if used repeatedly, they give a standard of accuracy 
which on the average is greater than we desire, in order to ensure that 
the standard is seldom if ever less than what is wanted. For this rea- 
son, tables 2.1 and 2.2 appear to be suitable for ordinary experimenta- 
tion; the methods described here should be used when it is very impor- 
tant to reach a specified standard of accuracy. 


2.3 Other Methods for Increasing Accuracy 


2.31 The Measurement of Relative Efficiency. We have scen that 
the error variance of the difference between the means for two treat- 
ments is 207/r, where r is the number of replications. This result pro- 
vides a simple means for comparing, in concrete terms, the relative 
precision of two experiments. Suppose that one design gives a true error 
variance of 1,0 per experimental unit, while a second gives 0.5. The 
two experiments would give equally precise comparisons among the 
treatment means if the amount of replication in the first were twice that 
in the second. From a knowledge of the error variances per unit, com- 
parisons are thus easily made in terms of the relative amounts of replica~ 
tion required to attain the same degree of precision. The inverse ratio 
of the variances per unit is sometimes called the relative efficiency of the 
two designs; in the example above, the relative efficiency of the second 
experiment to the first is 2.0 or 200%. Similarly we might speak of the 
relative gain in efficiency as 1.0 or 100% in this case. These terms occur 
frequently in the literature on experimental designs. 

One further factor must be taken into account. For the same numbers 
of treatments and replicates, the number of degrees of freedom in the 
estimate of error changes with the design. This number enters into many 
of the uses to which the experimental results are put. When the number 
of degrees of freedom becomes smaller, the limits of error for a true 
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difference are increased and the probability of obtaining a significant re- 
sult is decreased; in other words, the sensitivity of the experiment is de- 
creased. 

Thus the numbers of degrees of freedom for error are relevant to the 
comparison of two designs. A change in design which decreases the error 
degrees of freedom as well as the error variance may not be advantageous. 
This leads to the question: by how much must the experimental error 
variance be reduced so as to balance a given reduction in the error degrees 
of freedom? The question has been discussed by Neyman eé al. (2.1), 
Fisher (2.5), and Walsh (2.6). The answer depends on the use to which 
the results are put. From one point of view, two experiments may be 
regarded as equal in sensitivity when the limits of error for a given con- 
fidence probability are equal. Alternatively, equal sensitivity may 
imply an equal probability of detecting as significant a given real dif- 
ference between the effects of two treatments. Other definitions could 
be set up. 

Table 2.3 illustrates the situation. An experiment with a true error 
variance per unit equal to 1 (i.e., with an infinite number of degrees of 
freedom) was chosen as the standard. The table shows, for a range of 


TABLE 2.3 Errecr OF NUMBER OF DEGREES OF FREEDOM IN ERROR ON THE SEN- 
SITIVITY OF THE EXPERIMENT. ERROR VARIANCES PER UNIT WHICH GIVE EQUAL 
SENSITIVITY ACCORDING 'TO FOUR DEFINITIONS 


Number of error degrees of freedom (n) 
Defini- 
tion 
1 2 3 4 5 6 8 10 | 15 | 20} 30] 
1 0.04 | 0.26 | 0.45 | 0.56 | 0.64 | 0.70 | 0.77 | 0.81 | 0.87 | 0.90 | 0.94 | 1.00 
2 0.27 | 0.59 | 0.72 | 0.79 | 0.83 | 0.86 | 0.90 | 0.92 | 0.94 | 0.96 | 0.97 | 1.00 
3 0.22 | 0.47 | 0.61 | 0.69 | 0.75 | 0.79 | 0.84 | 0.86 | 0.91 | 0.93 | 0.95 | 1.00 
4 0.10 | 0.40 | 0.57 | 0.67 | 0.73 | 0.78 | 0.83 | 0.87 | 0.91 | 0.93 | 0.96 | 1.00 


values of the error degrees of freedom (n), the true error variances per 
unit which are required to equal the standard in sensitivity according 
to four definitions of this term. In case 1 the average limits of error for 
a confidence probability .95 are made equal. In case 2 the same cri- 
terion is used for a confidence probability .80. In case 3 all experiments 
have the same probability .25 of detecting as significant a constant dif- 
ference between two treatments. Case 4 shows the same comparison for 


28 METHODS FOR INCREASING ACCURACY 2.31 


a larger constant difference where the probability is .75. The tests of 
significance are one-tailed ¢-tests at the 5% level. 

For example the figure 0.70 for n = 6 (definition 1) means that with 
a, true error variance per unit of 0.70 and 6 d.f. in the estimate of error, 
the limits of error for a confidence probability .95 are the same on the 
average as when o” is 1 and 7 is infinite. A 30% decrease in error vari- 
ance is needed to compensate for the lack of degrees of freedom. 

The wide divergence between the results for definition 1 and defini- 
tion 2 is noteworthy since the only change in criterion is that from a 
confidence probability .95 to one of .80. Definitions 3 and 4 exhibit 
much closer mutual agreement and give results intermediate between 
those of definitions 1 and 2. In practical applications the discrepancies 
among the four methods will be smaller than table 2.3 suggests, be- 
cause comparable designs for the same experiment seldom differ greatly 
in the degrees of freedom available for error. 

Fisher’s approach (2.5) is somewhat different. In effect, he calculates 
the “amount of information” which the estimated difference d between 
two treatment means supplies about the true difference 6, the calcu- 
lation being made from the distribution of the quantity 


Vr(d —8) 
V 2s 
He finds the information to be (n + 1)/(n + 3)s?, whereas if « were 


known exactly the information would be 1 /o”. The estimated variances 
which give equal amounts of information are shown in table 2.4. 


TABLE 2.4 EsrimMarep VARIANCES WHICH PROVIDE EQUAL AMOUNTS OF 
INFORMATION 


Number of error degrees of freedom (n) 


1 2 3 4 5 6 7 8 9 10 i 12 15 20 30 2 


0.500 | 0.600 | 0.667 | 0.714 | 0.750 | 0.778 | 0.800 | 0.818 | 0.833 | 0.846 | 0.857 | 0.867 | 0.889 | 0.913 0.939} 1 


This table agrees closely with the results for cases 3 and 4, except 
when n is below 5. For general purposes it is suggested that this table 
be used to take account of the differences in degrees of freedom for error 
in two designs that are being compared. Suppose that a design with 
n = 6 is compared with one in which n = 12. The former gives more 
information only if the error variance s;° is less than (0.778/0.867)s2”. 
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Thus the relative efficiency of the first design to the second is estimated 
as 0.778892/0.86781”, or more generally as 


(m + 1) (m2 + 3)so” 
(m2 + 1)(m + 8)81 
The adjustment is of importance only if m; and mz are small. 

This expression applies to the simple case in which the two experi- 
ments have the same set of treatments. The experiments may, however, 
have different sets of treatments, although each provides an estimate of 
some specific treatment effect in which we are interested, Alternatively, 
a different method of estimation of the treatment effect may have been 
employed in the two experiments. In such cases we may wish to assess 
the relative accuracy with which this treatment effect is estimated in 
the two experiments. The expression given above still applies, except 
that s,2 and s:” become the estimated variances of the treatment effect 
as found in the two experiments, while n; and ng are as before the num- 
bers of degrees of freedom in 312 and 89” respectively. 

In succeeding sections a number of other devices which affect the 
accuracy of an experiment are presented briefly. 


2.32 Selection of Treatments. In certain cases the selection of the 
treatments has a substantial effect on the precision of an experiment. 
As Fisher has stressed, striking gains in precision may be achieved by 
testing different types of treatment in the same experiment, instead of 
conducting a separate experiment for each type. For instance, one 
type of treatment or factor might be the depth of ploughing in the 
preparation of a field for a crop, while another might be the addition of a 
nitrogenous fertilizer. We might conduct one experiment in which 
deep (D) and shallow (S) ploughing are compared, and another in which 
no fertilizing (QO) is compared with the addition of a specified amount of 
fertilizer (N). In a factorial experiment both factors would be tested 
simultaneously by means of the 4 “reatments’—(DO), (SO), (DN ), and 
(SN), where the symbol (DO) implies deep ploughing with no addition 
of nitrogen, ete. The average response to nitrogen is assessed by com- 
paring the last two treatments with the first two. This comparison 
gives as precise an estimate of the average response to nitrogen as if the 
whole experiment had been devoted to that factor alone. The same 
property holds for the estimation of the average difference in effect be- 
tween deep and shallow ploughing. Thus the experiment, as it were, 
does double duty. The use of factorial experimentation is discussed 
more fully in chapter 5. 

There are many specialized problems where the choice of the proper 
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amounts of some ingredient is important. As a very simple example, 
suppose that the response to increasing amounts of the ingredient is 
known to be linear and that the purpose is to determine the slope of the 
line. The most accurate experiment contains only two amounts of 
ingredient. These should be placed at the ends of the range within which 
the response is linear and experimentation is feasible. With three 
amounts of ingredient, two at the ends and one in the middle, the 
variance of the estimated slope, for the same total size of experiment, is 
1} times as large, and if more amounts are used it becomes larger still. 
A more complex example occurs in the estimation of the amount of a 
virus in a solution by preparing a series of dilutions of the solution and 
counting the numbers of lesions produced on leaves which are rubbed 
with the dilutions (2.7). The accuracy of the estimate is known to 
depend both on the dilution ratio that is chosen and on the number of 
dilutions that are used. 


2.33 Refinements of Technique. Since technique is the responsibility 
of the experimenter, its importance need not be elaborated. The prin- 
cipal objectives of a good technique are as follows. 

i. To secure uniformity in the application of the treatments. In 
pig-feeding experiments, for instance, a uniform amount of food cannot 
be supplied to each animal without the provision of individual feeding 
boxes. In the testing of insect sprays, delicate apparatus is required 
in order to subject each batch of insects to a desired dose of spray. 

ii. To exercise sufficient control over external influences so that 
every treatment produces its effects under comparable and desired con- 
ditions. It is difficult to generalize about the degree of control needed . 
a balance must be struck between the cost incurred and the gain in 
precision obtained. The artificial production of diseases for experiments 
on resistance to infection exemplifies a case where experimentation can- 
not proceed rapidly without such control over external conditions. 

iii. To devise suitable unbiased measures of the effects of the treat- 
ments. Often the appropriate measurements are readily apparent; 
sometimes the development of a satisfactory method of measurement 
requires years of research, as in the estimation of certain of the vitamins, 
in soil analysis, and in sociological investigations. 

iv. To prevent gross errors, from which no type of experimentation 
seems to be entirely free. Adequate supervision and checking of the 
work of assistants and a scrutiny by the experimenter of the data from 
every experimental unit will go far towards the discovery and rectifica- 
tion of errors. 

Faulty technique may increase the real experimental errors in two 
ways. It may introduce additional fluctuations of a more or less 
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random nature. Such fluctuations, if they are substantial, should reveal 
themselves in the estimate of error as calculated in the analysis of 
variance (chapter 3). Where his estimated standard errors are con- 
sistently higher than those of other workers with similar material, the 
experimenter is advised to seek the reason, which may lie in differences 
in technique. In addition, faulty technique may result in measurements 
that are consistently biased. The estimate of error does not take ac- 
count of such biases, since it is derived from comparisons of the measure- 
ments with one another; in other words, it estimates precision rather 
than accuracy. The principal safeguards against such biases are care 
and skill in the construction and handling of measuring devices, plus the 
intelligent use of randomization. 

It is worth while to consider from time to time whether simplifications 
can be brought into the technique without undue loss of accuracy. 
Many experiments involve chemical determinations which vary little 
from unit to unit within the same treatment. For instance, in sugar- 
beet experiments, the experimental error of the amount of sugar per 
acre is due mainly to the error in the weight of roots per plot. The 
amount contributed by the sugar content percent is nearly negligible. 

With compound measurements of this type, the labor devoted to the 
less variable component ean be reduced with little loss of accuracy. If 
the chemical measurement is expensive, a considerable saving in cost 
may be made. Tests of significance of the chemical composition need 
not be sacrificed. For example, if the experiment has six replicates, a 
chemical analysis might be made for each treatment on bulked material 
from the first three replicates and another on bulked material from the 
last three replicates. These data provide a valid test of significance based 
on two “replicates.” 


2.34 Selection of Experimental Material. The choice of the experi- 
mental unit may be of importance. In the planning of field experiments 
numerous studies have been made of the variability among crop yields 
on plots of various sizes and shapes under uniform treatment. From 
these data the best size and shape are selected. The criterion should 
be to obtain the maximum accuracy for a given expenditure of time and 
labor. 

Frequently, uniform material is prepared specially for experimental 
purposes, as in the development of inbred lines by laboratories engaged 
in animal experimentation. Alternatively, the experiment may be con- 
fined to a sample chosen for its homogeneity from a large batch of ex- 
perimental material. If the results of the experiment are to be applied 
to unselected material, these types of specialization have potential dis- 
advantages. Responses obtained to the treatments in highly selected 
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material may not be the same as the responses in unselected material. 

Selectivity in the material is difficult to avoid where experiments are 
conducted in the field of economics or sociology. In testing some method 
of farm management, for instance, the experiment may require the active 
participation of a group of farmers. The success of such experiments 
depends greatly on the tact and resourcefulness of the investigator in 
persuading farmers to cooperate so that the participants are a represent- 
ative sample of the population about which generalizations are to be 
drawn. 


2.35 Additional Measurements. In the course of an experiment it 
may be possible to take supplementary measurements which predict at 
least to some extent the performances of the experimental units. In an 
experiment to measure the effects of different diets on the weights of 
children, their weights at the start of the experiment would be a supple- 
mentary measurement of this kind, since the increase in weight of a 
child during the experiment is probably correlated with his initial 
weight. 

By a technique known as the analysis of covariance, we may estimate 
from the data the extent to which the observations were influenced by 
the variations in these supplementary measurements. The average 
response to each treatment can then be adjusted so as to remove the 
experimental error arising from this source. In the school experiment, 
the adjusted responses to treatments represent approximately the re- 
sponses that would be obtained if all the children had the same initial 
weight. Thus the effects of variations in the initial weights are largely 
eliminated from the experimental error, without the necessity of equal- 
izing the initial weights in the planning of the experiment. 

Since the analysis is a statistical technique of comparatively recent 
origin (it was first presented by Fisher about, 1932), it may be unfamiliar 
to many investigators who could use it to obtain a substantial increase 
in precision with little effort. Its purpose is to remove experimental 
errors arising from extraneous sources of variation which it is impractical 
or too costly to control by a more refined technique. Cases are fairly 
common where the use of covariance has more than doubled the effi- 
ciency of the experiment. The extra work involved consists in taking 
the subsidiary measurements and in applying the adjustments to the 
treatment means. An example of the computations is given in section 
3.8. 


2.36 Planned Grouping: Complete Blocks. Finally, we may attempt 
to minimize the experimental errors through the choice of the experi- 
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mental plan. The prospects of increased accuracy by this means have 
been widely explored during the past twenty years. This book is in- 
tended as a source of reference to the numerous procedures that have 
been developed. 

The basic idea is simple. Consider an experiment of which a number 
of separate replications have been conducted. The experimental errors 
of the results from any replicate can arise only from sources of variation 
that affect the units within that replicate. Consequently, the error of 
the difference between two treatment means taken over a number of 
replicates must also arise solely from variations within the individual 
replications, Variations from one replicate to another do not contribute 
to the errors. In carrying out an experiment we utilize this simple fact 
by trying to control sources of variation that affect different units in the 
same replicate. We need not attempt to reduce differences among the 
replicates. For example, if the experimental units form a very heter- 
ogeneous batch, we attempt to group them so that units in the same 
replicate are similar; we do not worry if the units in one replicate are 
not at all similar to those in another. By this device, precise experi- 
ments can often be made from what at first sight appears an unpromising 
batch of material. Similarly, if a uniform experimental technique cannot 
be maintained throughout the experiment, the important point is to 
keep the technique uniform within a replication; changes should be 
made when moving from one replicate to another. This type of design, 
known in agriculture as randomized blocks, is discussed in section 4.2. 

The idea is carried a stage further in the latin square (section 4.3). 
The treatments are arranged diagrammatically so that each appears once 
in every row and once in every column of a square array. Variations 
among the groups of experimental units which correspond to the rows 
and also among those which correspond to the columns are eliminated 
from the experimental errors. For example, in a comparison of 5 dif- 
ferent ways of performing some manual operation, the rows may repre- 
sent 5 operators, each of whom carries out all the methods in turn. The 
columns, numbered from left to right, may prescribe the order in which 
each operator performs the 5 tasks. The plan of a latin square design 
for this experiment might read as follows. 


Order of procedure 


Operator (1) (2) @) (4) (5) 
I D A B E c 

Il A B c D E 

Til B E A c D 
Iv c D E B A 
Vv. E c D A B 
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The letters A, B, C, D, E represent the 5 methods. Thus operator I 
carries out method D first, method A next, and so on. With this ar- 
rangement, the experimental errors are unaffected by differences among 
the abilities of the men and also by systematic variations introduced by 
the order in which the methods are performed. If the opportunity 
presents itself, this type of control may include a third factor by the use 
of a graeco-latin square (section 4.5). 


2.37 Planned Grouping: Incomplete Blocks. In the randomized block 
design, the precision depends to some extent on the number of treat- 
ments. As this number increases, it becomes more difficult to keep the 
variations between experimental units within each replicate small, and 
the error variance per unit tends to increase. In experiments with a 
large number of treatments, numerous attempts have been made to 
avoid this loss of precision by the formation of groups of experimental 
units which do not contain all the treatments and therefore can remain 
small. These groups are called incomplete blocks. The groups are so 
constructed that we can still remove the effect of the differences among 
groups from the experimental errors, just as such differences are auto- 
matically removed in the randomized blocks design. With incomplete 
blocks, however, this removal usually requires the application of adjust- 
ments to the treatment means, so that the statistical analysis of the re- 
sults becomes more complex. 

The best method of constructing the incomplete blocks varies with the 
nature of the experiment. Sometimes certain comparisons among treat- 
ments are of greater interest than others. For instance, in a field experi- 
ment containing a number of early-maturing and an equal number of 
late-maturing varieties of a crop, comparisons between pairs of varieties 
which mature at about the same date will presumably be more important 
than comparisons between an early and a late variety. An incomplete 
block might contain either all the early or all the late varieties, a pair of 
blocks forming a complete replication. This arrangement, of which 
several variations are possible, is known as the split~plot design (section 
7.1). 

A similar situation occurs in factorial experiments. Certain com- ~ 
parisons, e.g., the average effects produced by varying one factor, are 
nearly always of immediate interest. Others, which represent rather 
complex relationships among the effects of different factors, may be of 
minor importance, often because previous experiments have shown that 
these interrelationships, called interactions, exert little or no influence. 
It has been found that, by a deliberate sacrifice of precision in the 
estimates of these interactions, the size of the group or block can be re- 
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duced so as to produce an increase in the precision with which the average 
effects of the factors are estimated. This principle, known as con- 
founding, can also be combined with the use of the latin square (chapters 
6 and 8). 

If all comparisons between pairs of treatments are potentially of equal 
importance, a different method is used in forming the blocks. When 
treatments are arranged in incomplete blocks, two treatments which oc- 
cur in the same block are more precisely compared than two which are 
placed in different blocks, at least in so far as the investigator has suc- 
ceeded in decreasing the variation within blocks. Therefore the goal is to 
construct a design such that any pair of treatments occurs equally often 
within some block. This type of problem has been investigated pre- 
viously for its mathematical interest. A solution ean be found for any 
number of treatments and any size of block, but most of the solutions 
require too many replications for the usual conditions of experimentation. 
The solutions available to date (called balanced incomplete block designs 
(chapter 11)) are given for all cases in which the number of replications 
does not exceed ten. Here again the latin square principle can some- 
times be used (chapter 13) to eliminate variation amongst two different 
types of grouping of the experimental units. 

For a given number of treatments and a given size of incomplete 
block, balanced designs allow little choice in the number of replications. 
Thus, with 64 treatments and blocks of 8 units each, 9 replications are 
required to balance the design. In order to provide designs for small 
numbers of replications, a number of additional types have been de- 
veloped; these are similar to balanced designs except that they lack 
complete symmetry. 

One group, called lattice designs (chapter 10), has been used widely 
in agricultural experiments during the past 5 years. For these designs, 
the number of treatments must be an exact square, €.g., 25 or 49, while 
the number of units in the block is the corresponding square root, 5 or 
7, respectively. A further group, known as cubic lattices (section 10.4), 
is useful when the number of treatments exceeds 100 yet it is desirable 
to have a small block. In this case the number of treatments is the cube 
of the number of units per block, and the number of replications may be 


any multiple of 3. 


2.38 Summary. There are numerous methods available for increasing 
the accuracy of an experiment. Frequently the same end may be 
reached in several different ways. Thus, in the experiment which tests 
the effects of a number of diets on the weights of children, the disturbing 
effects of variations in their initial weights can be reduced by selecting 
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for the experiment a number of children of approximately the same initial 
weight, or by the use of the analysis of covariance, or by a planned 
grouping in which children within the same replicate are of approxi- 
mately equal initial weight, or simply by having a large number of 
replications. The method adopted should be that for which the desired 
standard of accuracy can be attained with the smallest expenditure of 
time and effort. There is no special merit in either a complicated ex- 
perimental plan or a highly refined technique if equally accurate results 
can be secured with less effort in some other way. A good working rule 
is to use the simplest experimental design that meets the needs of the 
occasion. This is not to say that the more complex designs will be used 
only rarely; in fact, they have already demonstrated their utility in 
fields of research where no other equally practicable method appears to 
exist for reaching the same results. 


2.4 The Grouping of Experimental Units 


2.41 Investigation of Methods for Grouping. In order to take full 
advantage of the opportunities for increased precision by suitable group- 
ing of the units, the investigator must know the best criteria for group- 
ing. Frequently these are suggested from previous observations on the 
experimental material or on disturbing factors that become evident in 
the course of an experiment. Where knowledge is less definite, as in a 
new line of research, there are two ways in which information may be 
accumulated. 

One is to devote an experiment specifically to this problem. In this 
ease it is usually advisable to subject all units to the same treatment, 
carrying out a so-called uniformity trial. The word uniformity refers to 
uniformity of treatment; the experimental material and technique should 
be representative of those which are used in actual experiments. The 
amount of material should be sufficient to enable the experimenter to 
superimpose on the results a hypothetical trial of the size which he cus- 
tomarily performs. It is worth while to record any auxiliary measure- 
ments which might predict the performance of the experimental units. 
From the results of these trials any proposed grouping of the units can 
be formed and the amount of variation within and among the groups 
can be calculated. If it is desired to compare different experimental 
plans, each may be superimposed on the results of the uniformity trial. 
By the method indicated in section 2.31, the investigator may estimate 
the relative numbers of replications that are needed to reach the same 
degree of precision with the various plans. 

Secondly, useful information can sometimes be secured without any 
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special trial or technique by examination of the results of actual experi- 
ments. With an experiment which is suitably designed, it is possible to 
estimate from an analysis of variance of the results what the error vari- 
ance would have been if any particular grouping had not been used. This 
estimate, being derived from the same data, is directly comparable with 
the error variance obtained by the use of the grouping and enables the 
investigator to evaluate the success of the grouping. Consequently, for 
experiments in a new line of research, the experimenter may try any 
method of grouping that might be effective, in the knowledge that its 
appropriateness will be revealed by the experiments themselves. In- 
structions for performing such comparisons, which are easily made when 
the analysis of variance has been completed, are given in the notes which 
accompany each type of design. 


2.42 Criteria for Grouping. Many factors have been used as the basis 
for grouping. In agricultural experimentation it has been repeatedly 
demonstrated that plots close together tend to be more similar in their 
yields than plots farther apart. The almost universal practice is to put 
neighboring plots in the same group, with the groups approximately 
square in shape whenever practicable. Contiguity frequently forms a 
good basis for grouping in greenhouse experiments also, where differences 
may exist along the bench in temperature, sunlight, air currents, or ac- 
cessibility for watering. Alternatively, in many plant experiments, . 
and more particularly in animal experiments, some characteristic of the 
plant or animal is much more useful, the physical location of the ex- 
perimental units throughout the course of the trial having a relatively 
minor influence on the results. Age, weight, vigor, sex, and genetic con- 
stitution are some of the factors most commonly used. The animal it- 
self may constitute the block in cases where the treatments can be applied 
in succession to the same animal without producing residual effects which 
obscure the results. 

A single grouping, as in the randomized block design, can be used to 
eliminate simultaneously variation from a number of different sources. 
For example, suppose that an experiment were planned to investigate 
whether some treatment applied to rats enabled the animals to with- 
stand subjection to a dose of a poisonous gas. The replication would 
contain two animals, a treated rat and an untreated rat to be included 
asa control, Given a sufficient stock of experimental animals, the pairs 
in any replication might be animals of the same litter (and consequently 
of the same age), of the same sex, and of approximately the same weight 
and vigor. If it were convenient to test two animals at the same time, 
the pair could be put into the gas chamber together, thus eliminating 
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also the effects of variations in the strength of the dose or in the time for 
which it was supplied. 

The freedom with which the size of block can be varied depends on 
the type of research. In field experiments the block can be laid down so 
as to accommodate any number of treatments, though, as previously 
stated, the standard error per plot may be expected to increase slowly as 
the size of block increases. When the use of designs with incomplete 
blocks is contemplated for field experiments, the principal question is to 
decide for what number of treatments the reduction in block size be- 
comes worth while. On the other hand, the number of leaves on a plant, 
or of animals of the same litter, usually varies within rather narrow 
limits. In such cases, where the most homogeneous grouping fixes the 
size of the block, incomplete block designs may be of special interest. 

The gains in efficiency obtainable by skillful construction of the 
blocks, when measured in terms of increased replication, are often large; 
examples can be cited in which the effective replication has been in- 
creased from two- to tenfold. Frequently, apart from the preliminary 
care and thought involved, an accurate design uses the same material 
and involves the same physical operations as the design which it replaced. 
In some of the more complex designs the computations required to es- 
timate the treatment averages and to perform tests of significance be- 
come rather laborious and involved. The experimenter’s reaction to 
these difficulties will depend on his aptitude for numerical computations. 
At a first trial considerable time may be spent in mastering some of the 
methods, but the processes become easier with familiarity. 
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CHAPTER 3 


NOTES ON THE STATISTICAL ANALYSIS OF THE RESULTS 


3.1 Introduction 


With regard to the statistical analysis for a design, the procedure 
adopted in this book is to present the computations, with worked nu- 
merical examples for the designs likely to be most frequently used. To 
supply a complete mathematical justification of the analysis for each 
type of experiment, although in many respects desirable, would have 
unduly increased the amount of material. The same general technique 
governs the analysis of all designs, differences arising only in the adapta- 
tion of the technique to the particular structure of a design. In the 
present chapter this technique—the method of least squares—is de- 
scribed and applied to one of the simplest designs. With perseverance 
the method may be used to verify the computing instructions for other 
designs in this book. It should be pointed out that the computing meth- 
ods that are easiest in practice are not necessarily those that flow from a 
straightforward application of the theory, so that at first sight some of 
the instructions may appear different from those given by theory. 

Some account will also be given of certain statistical techniques, such 
as the technique for missing values and the analysis of covariance, that 
are particularly useful in handling the results of experiments, though 
they are perhaps too specialized to be included as part of a general in- 


troductory course in statistics. 


3.2 The General Method of Analysis 


3.21 The Mathematical Model. The illustrative data come from an 
experiment conducted in 1935 by the Rothamsted Experimental Station 
(3.1). The object of the experiment was to measure the effectiveness of 
4 soil fumigants in keeping down the numbers of eelworms in the soil. 
The fumigants were chlorodinitrobenzene (CN), carbon disulphide jelly 
(C8), and two proprietary preparations, “Cymag” (CM) and “Seekay”’ 
(CK). Each fumigant was tested both in a single and a double dose. 
These comprize 8 distinct treatments. The control (no fumigant) was 
39 
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the ninth treatment. Normally the replication would have consisted 
of 9 plots, but in this case 4 control plots were placed in each “replicate,” 
which actually contained 12 plots. The purpose was to supply a fairly 
accurate standard against which to measure the performance of the 
fumigants. 

The experiment was laid out in 4 separate blocks of 12 plots each, so 
that there were 4 replications for each dose of each fumigant and 16 
replications of the control. Within a block the 12 “treatments” were 
allotted to individual plots at random. The experiment is of the 
familiar type known in agriculture as randomized blocks (see chapter 4). 
The fumigants were ploughed in during spring, after which a crop of oats 
was sown. After harvest, a sample of 400 grams of soil was taken from 
each plot and the number of eelworm cysts counted for each sample, 
The data are those labelled ‘second count” in table 3.1. 


TABLE 3.1 PLAN AND NUMBERS OF EPLWORM cysts PER 400 GRAMS OF 
SOIL, FIRST COUNT ABOVE, SECOND BELOW 


2CM | 2CS | 2CK 0 
95 | 127 80 | 134 
199 | 166 | 142 | 590 
Block 1 Block 2 
Totals 10K | 1cN]icM] 0 | Totals 
2587 1o7 | 89} 41| 74 964 
4383, 236 | 332 176 | 137 3075 
0 0 | 2CN| 1CS 
88 25 42-| 62 
356 | 212 | 308 | 221 
20K} 0 | 1CK | 1CM 
193 | 209 | 109 | 153 
292 | 352 | 1382 | 454 
Block 3 0 | 2CN} 2CS| 0 Block 4 
Totals 29 9 17 19 | Totals 
1743 254 | 92 | 28 | 106 872 
4752 2470 
1CS | 1CN}| 0 | 2CM 
23, 19 Ad 48 
80 | 114 | 268 | 298 


In a formal analysis of the results the first step is to set up an equa- 
tion for every observation. This equation expresses the observation as 
the sum of four components: (i) a general average about which the 
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observations are presumed to be fluctuating; (ii) a component repre- 
senting the effect of the treatment applied; (iii) a component represent- 
ing certain environmental effects which the design of the experiment 
enables us to isolate; and (iv) a residual component, representing all 
other sources that influence the observation, and generally referred to as 
the “experimental error.” 

Component (iii) cannot be fully appreciated without some knowledge 
of the technique employed in the construction of designs. This technique 
consists in arranging the conduct of the experiment so that, in the sub- 
sequent analysis of the results, differences among certain groups of ob- 
servations can be measured. What is more important, the effects of such. 
differences can be eliminated from the estimates of the treatment effects. 
If this elimination were not feasible, these differences would contribute 
to the “experimental error,” component (iv), with the consequence that 
less accurate estimates of the treatment effects would be obtained. In 
the present example, as will be seen, these differences are differences 
among the groups of 12 observations that constitute a block. With 
each design that we describe, the environmental components that are 
eliminated in this way are pointed out. 

We proceed to write down the mathematical model as applied to the 
example. Instead of numbering the observations from 1 to 48, it is con- 
venient to use a subscript 7 to denote the treatment applied, j to denote 
the block in which the observation lies, and /: to denote the order within 
the block. For the plots that receive fumigants, the subscripts 7 and j 
are sufficient to define each plot uniquely. ‘The subscript & is needed to 
distinguish between different control plots within the same block. In 
this notation the equation for any observation may be written 


Yije = B+ Ti + Bi 1 ise 1) ¥ 
where p represents the general mean, 7; the effect of the treatment, 8; 
that of the block, and e;;x the experimental error. 


3.22 Assumptions Made in the Model. There is already implicit in 
the model one assumption that may not be true in the data obtained 
from experiments. This assumption is that the treatment effect and the 
block effect are additive. If we take the difference between two observa- 
tions in the same block, say Ysjx, 20d Yozk» We have, as a deduction from 
the model, 

Yajey — Yoike = TL — 721 Crjtr — C2ike (8.2) 
The block effects have disappeared from the right-hand side; that is, 
the model implies that the difference between the true effects of two 
treatments is the same in all blocks. On a block with a high eelworm 
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infestation a successful fumigant is presumed to reduce the eelworm 
numbers by the same amount as on a block with a low eelworm infesta- 
tion. This is unlikely to be so, except as a first approximation. Little 
systematic study of the accuracy of this approximation in practice has 
been made. From general experience it appears that the approximation 
works well in a large number of experiments; nevertheless, cases are not 
infrequent where the approximation is poor. 

In addition, several assumptions are made about the residual effects 
eije- These are taken to be independent from observation to observation, 
and to be distributed with zero mean and the same variance o”. Further, 
for tests of significance and the estimation of confidence limits, the ¢;;x 
are assumed to follow the normal or Gaussian frequency distribution. 
In practice, these assumptions are only approximately fulfilled. The 
consequences of errors in the assumptions will be indicated later (section. 
3.9). 

Apart from changes in detail, the type of model and assumptions 
presented for this example apply to all experimental arrangements in- 
cluded in this book, except for two types of design for which a more 
complex model is required. These are the split-plot designs (chapter it) 
where two or more different error variances are postulated, and in- 
complete block designs (chapter 9), where additional assumptions are made 
about the nature of the environmental effects A;. 


3.23 Estimation of the Treatment Effects. When the assumptions in 
the previous section are valid, there is a well-known result in statistical 
theory that the best estimates of the unknowns p, 7;, and 6; are obtained 
by the method of least squares. This method chooses estimates m, ¢;, and 
bj, respectively, which minimize the sum of squares of the residuals 

D Win — m — ts — b,)? (3.3) 
taken over all observations. 

Tn order to find these minimizing values we differentiate the sum of 
squares with respect to each unknown in turn, and set the derivative 
equal to zero. Consider a specific unknown, e.g., t;. If any square con- 
tains ¢;, its derivative is 

—2(ysie — m — ti — bj) 
If the square does not contain ¢,, the derivative is zero. Hence, if the 
derivative of the sum of squares in (3.3) is set equal to zero, the resulting 
equation may be written 


Dd Wik — m — t; — bj) =0 


YS m+ +b) =D yin (3.4) 


or 
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where the sum is over all observations whose equations contain #;, or 
more generally over all observations whose equations contain the param- 
eter to be estimated. 

Equations (3.4) are called the equations of estimation, or the normal 
equations. As we have seen, the equation of estimation for any param- 
eter is constructed by equating the total of the observed values y;j1, to 
the total of the expected values (m + ¢; + b;) over all observations whose 
equation contains the parameter. - 

We now write down some normal equations for the example. Since 
m oceurs in all 48 observations, its normal equation is 


48m + A(t, + te +++-+ ts) + 16f + 12(1 + be +-+++ bs) =G@ (3.5) 


where ty denotes the control treatment, and G the grand total of all 48 
observations. At this point a simplification may be introduced. In 
the model the quantities ¢; are needed only to indicate by how much an 
individual treatment differs from the average of all treatments in the 
experiment. Accordingly the model is not changed in any essential 
feature if we assume that the mean of all the ¢; is zero. It may also be 
shown that any convenient linear function of the ¢; may be assumed to 
be zero. The same remarks apply to the block values bj. From the form 
of equation (3.5) it appears that convenient assumptions of this kind are 


htb+-:-t+ts + 4 =0 (3.6) 
by + bo + bz + bs = 0 (3.7) 


With these assumptions equation (3.5) reduces to 48m = G, so that 
m becomes the mean value of all observations in the experiment. 

By summing over the observations that receive a particular treat- 
ment ¢; we find that its normal equation is 


dm + 4; + by $e tbs tho = 7, @= 1,2, +++, 8) 
and for the control, 
16m + 16% + 4(br + bo + bs + bs) = To 
where the 7’s denote observed treatment totals. From relation (3.7), 
these equations reduce to 


T; F T9 
efits Gai 28) Sag ol” (3.8) 


As might be expected, the effect of any treatment is estimated by 
taking the difference between the mean of the observations which re- 
ceive that treatment and the general mean. 
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Let us express these estimates in terms of the original model (3.1). 
By adding the relevant equations we find 


1; = 4p + 47; + (61 + Bo + Bs + Ba) + DL esse 
Ty = 16u + 1679 + 4061 + Be + Bs + Ba) + DL Coie 


Hence, using (3.8), 
m+t=ntu tt + Bo + Bs + Ba) + 2 


where é; is the average of the residual errors on plots that receive treat- « 
ment i. For the difference between a pair of treatments we deduce 


t — ty = (2 — tu) + G — Gi) (3.9) 


This shows that the estimated difference equals the true difference plus 
an error of estimate (2; — é,). Note that the error of estimate does-not 
contain any of the environmental (block) effects arising from compo- 
nent (iii). This result exemplifies a valuable property of least squares 
solutions, namely, that the estimate of the difierence between the ef- 
fects of any two treatments is not influenced by the environmental ef- 
fects, but only by the residual effects ¢;;x. It is in this sense that: we say 
that environmental effects are eliminated from the estimates of the treat- 
ment effects. 

Estimates of the block or environmental effects are usually of less 
interest. It may be verified that the least squares estimate of 6; is 

b; = eu —™m 
12 


where B; is the observed block total. 


3.24 Tests of Significance and Confidence Limits. When the estimates 
of the treatment effects have been obtained, the details of the subsequent 
study of them vary with the type of experiment. Usually, however, 
some of the following procedures form the basis for the final conclusions: 
(i) a test of significance of the null hypothesis that the effects 7; and 
7, of two treatments are identical, or, more generally, of the hypothesis 
that some linear combination of the 7; is zero; (ii) the construction of 
confidence limits for the difference (+; — 7.) between the effects of two 
treatments, or for a linear function of the treatment effects; (iii) a test 
of significance of the null hypothesis that a group of treatments 71, 
72, °**, Tp all have identical effects. In this section the theory govern- 
ing these procedures is presented briefly, without proof. The arith- 
metical methods are described in the next section. 


ee ae a) ee 
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i. Test of significance of the difference between two treatment effects. 
From equation (3.9) we have 


ty — ty = (r¢ — Tu) + G = Gu) (3.9) 


Since the e’s are normally distributed with zero means, it follows by 
theory that (t; — t,) is normally distributed about the true difference 
(rz; — tu). Further, the variance of (t; — t,) may be shown to be 


1 1 
. C + =) 
ee te 


where 7; and r, are the numbers of replications of ¢; and tu, respectively. 
Since it is useful to be able to calculate the variance of any comparison 
in which we are interested, the rules for doing this are given in section 
3.5. 

In order to proceed we need an estimate of the experimental error 
variance o2. From theory, the best estimate is known to be 


v= > 


where the residual sum of squares is taken over all observations. The 
divisor ne, called the number of degrees of freedom (d.f.) in the estimated 
error, is given by the rule 
ne = (Total number of observations) — (number of independent 
parameters that were estimated) 
The word independent is introduced because we nearly always include 
in the original model more parameters than are strictly necessary. In 
the example we found that we could assume one linear relation (3.6) 
among the é’s and one (3.7) among the b’s. Consequently the number of 
independent parameters is 
1 (for m) + 8 (for the ’s) + 3 (for the b’s) = 12 
Since there are 48 observations, e is 36. 
Hence, if the null hypothesis is true, i.e., Ti = Tu, then (t; — tu) is 
normally distributed with mean zero and estimated variance 


(yijx — m — ti — Dj)? 


Ne 


The ratio 


) 
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is known to follow Student’s tdistribution with n-, or 36 df. This is the 
quantity used for a test of significance of the null hypothesis. 

With a more complicated function of the estimated treatment effects, 
say L = (wit) + wel, +++-+ wl), Where the w’s are any set of num- 
bers, the procedure is essentially the same. The estimated variance of 
Lis s°f, where f is a numerical factor found by the rules given in section 
3.5, Fora test of the null hypothesis that the true value of L is zero, we 
use the ratio L/. sV if, which is distributed as Student’s ¢ with n, degrees 
of freedom. 

ii. The construction of confidence limits. The theory for a test of sig- 
nificance also leads to confidence limits for the unknown true difference 
(t; — tu). For, when 7; and 7, are not necessarily equal, the quantity 


(is = tu) — i = 7) 


may be shown to follow Student’s tdistribution with n, degrees of free- 
dom. Hence the confidence limits are given by 


1 1 
(7; — tu) = (ts — tu) = s(n, 2) .{— --— 
. r% Tu 


where d(ne, a) is the value of Student’s ¢ corresponding to me degrees of 
freedom and the chosen confidence probability a. A similar method is 
used for the more general function L. 

tit. A test of the identity of a group of treatment effects. Quite frequently 
the first test to be made is that of the hypothesis that all & treatments 
have produced identical effects, i.e., that all 7; are equal. In other cases 
we are more interested in the hypothesis that some subgroup of the treat- 
ments, say treatments 1 to p, has produced the same effects. We con- 
sider the latter test, since it reduces to the first test if p is put equal to k. 

This test is provided by a general theorem that has many applications. 
First, we find the least squares estimates of all treatment and environ- 
mental effects in the usual way, and compute the residual sum of squares 


82 = D0 Win — m — t — b,)? 


Next we start again and rewrite the mathematical model, inserting the 
restriction that 7; = 72 = -:: = 7». That is, wherever any of these 
7; appears in the equation for an observation, we replace the 7; by a 
common symbol, say 7’. The least squares estimates are computed for 
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this new set of equations, using the same observations. Again we com- 
pute the residual sum of squares which may be denoted by 


So? = DS (ye — mm! — Uf — 2; 
This will always be found to be at least as large as 5”. Finally, the 
theorem states that, if the null hypothesis is true, the quantity 


So? — 8:7 : Si? 
p-1l mM 


follows Snedecor’s F-distribution with (p — 1) and n, degrees of freedom. 

Although the procedure may appear rather complex, the test criterion 
used is a reasonable one. S,? is the sum of squares of deviations of the 
observations from the values predicted for them by the original model, 
so that it measures how closely the original model agrees with the data. 
So? plays the same role for the restricted model. Consequently, if So” is 
much larger than Sj, we are inclined to think that the restricted model 
does not fit the data nearly so well as the original model, and therefore 
to reject the null hypothesis that the restricted model is the correct one. 
However, the value of (S2” — 8,2) alone does not provide a measure of the 
improvement in the fit with the original model. If, for instance, this dif- 
ference is 10, then, other things being equal, we should regard the im- 
provement in fit as greater when S,? is 1 than when Sj? is 1000. It is this 
type of consideration that leads to the use of the ratio (S22 — S17)/S:? 
as the essential part of the test criterion. 

Although the theorem postulates two separate sets of least squares 
solutions, nearly all designs are constructed so that only one set of solu- 
tions must be found in practice. ‘A number of designs which could have 
been included in this book were omitted because the least squares 
analysis seems too cumbersome for frequent use. 


3.25 The Analysis of Variance. All the procedures (t-tests, F'-tests, 
and construction of confidence limits) use the residual sum of squares, 
which will often be called the error sum of squares. This quantity could 
be found by calculating for each observation yijx the value (m + ti + by) 
predicted by the least squares solution.. The sum of the squares of the 
differences between observed and predicted values could then be ob- 
tained. This method is slow, and the error sum of squares is much more 
quickly computed by a technique known as the analysis of variance. 

Tn the original model, each observation is represented as the sum of 
four components due respectively to the general mean, the effect of the 
treatment, the environmental effect, and the residual effect. In the same 
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way the analysis of variance partitions the sum of squares of the ob- 
servations into four sums of squares, one attributable to the general 
mean, one to differences between the estimated effects of the treatments, 
one to the environmental effects which the experiment is capable of 
measuring, and lastly one which is the residual or error sum of squares. 
Tn most cases we compute the original sum of squares and the first three 
components, obtaining the error sum of squares by subtraction. 

The analysis of variance provides much more than a short-cut method 
of securing the error sum of squares. The sum of squares due to treat- 
ments is the quantity (S_” — S,”) needed for the /’-test of the hypothesis 
that no differences exist between the effects of the treatments. By a 
slight extension the analysis also supplies the sum of squares required 
for testing the equality of the effects of a subgroup of the treatments. 
The component due to environmental effects enables us to estimate by 
how much the accuracy of the experiment has been increased by eliminat- 
ing these effects from the estimates of the treatment means. 

The analysis of variance depends on a number of algebraic relations 
which will be illustrated for the eelworm experiment. From section 3.23 
it will be recalled that the least squares normal equation for any un- 
known, say ¢;, was 


DL (sje — m — t; —B)) = 0 
summed over all observations which received the ith treatment. If we 
multiply this equation by ¢; and add the equations for different treat- 
ments together, we obtain the equation 

> tyr — m — t — b) = 0 (8.10) 
where the sum is now over all observations, since every observation is 
associated with one and only one {;. Similarly we may establish the re- 
lations 

Xi myn — m — t; — bj) = 0 (8.11) 

DX yin, — m — t; —8;) = 0 (8.12) 
where both sums extend over all observations. 


A few more relations of this type are required. If we add the estimates 
t; over all plots in the jth block, the sum is 


(i + to + ty +-+++ tg) + 4ty 


But this is zero because of equation (3.6) in which a linear relation among 
the #’s was introduced. Hence, if we multiply by b; and add over all 
blocks, we establish the relation 


¥Y bt; = 0 (3.18) 
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where the sum is again over all observations. By similar arguments we 
prove the additional relations 


mb; =0; Domi =0 (3.14) 


These relations lead to the partition of the sum of squares of the ob- 
servations. Write 


Yair = m+ ty + d; + (yaze — m — ty — Dj) 
Square both sides, and add over all observations. The six relations 
(3.10) to (3.14) show that all six sums arising from cross-product terms 


on the right-hand side add to zero. Consequently, we have the following 
analysis of sums of squares. 


YS yi = De + Dt + LVF + D Gi —m — bh — ¥)? (B16) 


This equation is the basis of the analysis of variance of the results. 


3.26 Application to the Example. To obtain the left-hand side of 
(3.15) in practice, we simply compute the sum of the squares of all ob- 
servations. For the eelworm data this will be found to be 5,481,198. 
The components due to the mean, to treatments, and to blocks are not 
usually calculated from (3.15) as it stands, because it is quicker and more 
accurate to obtain them from totals rather than from the estimated ef- 
fects. 

Thus, for the mean, the contribution is 48m”, since there are 48 ob- 
servations. If Gis the grand total of all observations, 14,680, this con- 
tribution may be written G?/48, or (14,680)?/48, which amounts to 
4,489,633. This term is sometimes ealled the correction for the mean. 

Written in full, the sum of squares for treatments is 


A(t? +t? +++ ++ ts") + 1660" (3.16) 
But from (3.8), 


baton (@j =1, 2, +--+, 8); bata m 
where 77; is the observed treatment total. By substitution, (3.16) be- 
comes 
A(T, — 4m)? + (Lo — 4m)? +++++ (Lg — 4m)?] + Ze (To — 16m)? 
When each parenthesis is expanded, we have 


2 Be tages Ere Serif 
eae FTE 4 TE = Bly + Ta ++ To + Te) + in? 
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Now the sum of the observed treatment totals 7; is the grand total G, 
or 48m. Consequently the last two terms above may be amalgamated to 
give 
7. 2 7. 2 eels T, 2 T 2 
sees 2 re a 48m? (3.17) 


Note that the last term is the correction for the mean, already found to 
be 4,489,633. Expression (3.17) is an example of the more general ex- 


pression 
k T? 
ee 
ial T% 
which gives the treatments sum of squares when the ith treatment is 
replicated 7; times, C being the correction for the mean. 
To obtain the numerical value we first calculate the treatment totals 
as shown in table 3.2. 


TABLE 3.2 TREATMENT TOTALS FROM TABLE 3.1 


Level of 
application CN Cs CM CK Totals 
0 5858, 5858 
1 1066 928 1431 892 4317 
2 1265 877 1241 1122 4505 


The treatments sum of squares is given by 


1066)? + (1265)? +--+--+ (1122)? (5858)? 
aoe - ee 2 ~ — 4,489,633 = 157,448 


The blocks sum of squares, >) b;”, can likewise be expressed in terms 
of block totals as follows. 


By? + Bo? + Bs? + Be 


Cc 
12 
4383)? + (3075) + (4752)? + (2470)? 
is Se a _ eR 3 4,489,633 = 289,427 


where the observed block totals have been inserted from table 3.1. The 
divisor 12 is the number of observations per block. 

The complete analysis of variance is shown in table 3.3. It is not 
customary to display the correction term for the mean, since this is of no 
particular interest. The item labelled “‘total” at the foot of the table is 
the original sum of squares, minus the correction for the mean (5,481,198 
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— 4,489,633 = 991,565). This may be shown to be equal to the sum of 
squares of deviations of the observations from their mean; i.e., 


DS yen? — 48m? = Yo (yee — m)? 


TABLE 3.3 ANALYSIS OF VARIANCE OF THE DATA IN TABLE 3.1 


Degrees of Sum of Mean 

freedom squares square 

Source of variation (d-f.) (s.s.) (m.s.) 

Treatments 8 157,448 19,681 

Blocks 3 289,427 96,476 

Error (by subtraction) 36 544,690 15,130 
Total 47 991,565 


The column which has been computed above is, of course, the column 
labelled “sum of squares.” The other two columns remain to be ex- 
plained. The “degrees of freedom” associated with any component are 
the number of independent parameters required to describe that com- 
ponent in the model. In the case of treatments, this always equals one 
less than the number of treatments, and similarly for blocks. For the 
total s.s., the number is the number of observations (48) less one repre- 
senting the contribution of the mean. 

The degrees of freedom have two principal uses. First, by subtraction, 
they give the degrees of freedom for error (36). As mentioned in section 
3.24, this is the divisor needed for the error s.s. in order to estimate the 
error variance 0”. ‘The estimate is 544,690/36, or 15,130, which is called 
the error mean square and is shown in the right-hand column of table 3.3. 
Second, the degrees of freedom for treatments are used in an F-test of 
the hypothesis that all & treatments produced the same effects. In 


section 3.24 the value of F for testing this hypothesis was given as 


The denominator Si2/ne is the error m.s. In the numerator the quan- 
tity (S.? — Si?) may be shown to be the treatments s.s., while (% — 1) 
is the number of degrees of freedom associated with treatments. Thus 
the numerator is 157,448/8, or 19,681, called the treatments mean square. 
The F-ratio for this experiment is 19,681 /15,130, or 1.30, with 8 and 36 
df. 

In the same way the mean square for blocks forms the numerator for 
the F-test of the hypothesis that all block effects are identical. Although 
this test is rather seldom of interest, the mean square for blocks has other 
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uses. By formulae which will be given with the individual designs, this 
mean square leads to an estimate of the error variance that would have 
been obtained if the experiment had not been grouped into blocks. 

With some designs the structure of the analysis of variance is more 
complicated than in the example above. What happens is that the 
treatment and environmental contributions to the sum of squares are 
entangled. If we calculate the treatments s.s., a different result is ob- 
tained according to whether we assume that block effects are present or 
absent, and vice versa for blocks. In such cases the usual procedure is 
to present the blocks s.s. as calculated when treatment effects are ig- 
nored, and the treatments s.s. as calculated when block effects are taken 
into account. These items are sometimes called “blocks, ignoring treat- 
ments” and “treatments, eliminating blocks,” respectively. If calcula- 
tions are made in this way, the two terms may still be added so as to 
obtain the error s.s. by subtraction; further, the treatments s.s. is still 
the appropriate one for an F-test of the treatment effects. 

This presentation of the analysis of variance is necessarily very in- 
adequate. For further reading, reference should be made to Fisher 
(8.2, 3.8) and Snedecor (3.4), whose discussions involve little math- 
ematics. For presentation of points of theory, see Kendall’s book (3.5) 
and papers by Fisher (3.6, 3.7) and Yates (3.8). Unfortunately, no single 
reference contains a really comprehensive account of the subject. 


3.3 Accuracy in Computations 


3.31 Original Records. If there exists a series of rules which will 
guarantee that all computations are accurately done, we do not know 
them. The following notes may help in maintaining a high standard of 
accuracy. 

The first place where errors may occur is in the original records. These 
should be made in a clearly legible and permanent form. Where scale 
readings are being taken, the habit of checking each reading immediately 
after it has been written down is a useful one. As the observations are 
being recorded, any that appear anomalous should be examined and the 
reason sought. Gross errors are often eliminated in this way. If on 
checking such an observation no error or explanation is found, a note to 
this effect should be made. Such notes are helpful in situations where 
the later analysis is done by a different person. Most statisticians who 
analyze other people’s experiments have encountered the perplexing 
problem where an observation is so completely out of line that a gross 
error is suspected, yet the original records contain no comment about 
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the observation, and the recorder’s memory of the circumstances in 
which it was taken has faded. 

Copying from the original records to a form suitable for analysis may 
also introduce errors. Since cross-checking is rather a soporific task, mis- 
takes tend to be overlooked even when two persons are used for the 
check. Sometimes foresight can eliminate the need for copying. 


3.32 Checks in the Analysis of Variance. The analysis of variance it- 
self consists of two sorts of operations. First, a number of totals are 
formed. These are nearly all self-checking, for the fact that the treat- 
ment totals and the block totals both add to the grand total may be ac- 
cepted as proof of their correctness. This is not a watertight check, since 
compensating errors in two treatment totals pass undetected. Also it 
does not ensure that treatments and blocks are correctly identified, and 
cases have occurred where treatment totals were mistakenly labelled as 
block totals, and vice versa. But apart from such rare events the sum 
check is as satisfactory as any. 

The second stage, the computation of the various sums of squares, is 
not self-checking and must be checked by recomputation. The most ac- 
curate method is to have two computers carry out the analysis inde- 
pendently, comparing results only when both have finished the task. 
The practice of having one computer “check” the results of the other, 
though more convenient, always seems to produce a certain number of 
errors. This is to be expected, because, when the first calculations are 
nearly always correct, as they should be, it is difficult for the checker to 
avoid the presupposition that they are correct, with resulting mental 
laziness. Recomputation by the same person, though often unavoidable, 
is probably less accurate than the two preceding methods. 

‘An expert who wishes to do so can make the analysis self-checking. 
For instance, the error s.s. can be computed independently of the other 
sums of squares, sometimes quite easily and sometimes with more diffi- 
culty. If this sum of squares agrees with that found by subtraction, all 
the items in the “sums of squares” column may be regarded as correct, 
apart possibly from errors in labelling. However, with the computa- 
tional methods that we present, checking is required except in a few cases 
specifically noted. 

The steps in the preparation of summary tables (¢.g., calculation of 
treatment means from totals) are usually partly but not entirely self- 
checking. For example, the fact that treatment means average to the 
general mean forms a check. Care should be taken to recompute those 


steps that are not self-checked. 


54 NOTES ON THE STATISTICAL ANALYSIS OF THE RESULTS 3.32 


With the more complex designs, where treatment means have to be 
adjusted in order to eliminate the environmental effects, errors have 
arisen because the computing instructions were misunderstood. For 
instance, the adjustments made were ten times as large as they should 
have been. A person thoroughly familiar with the design would have 
known that the adjustments were ridiculously large, but someone new 
to the design may have no idea what size of adjustment it is reasonable 
to expect. This danger with complex designs should not be ignored. It 
can be minimized by careful study of the method of analysis for any 
design that it is proposed to use. 


3.33 Number of Figures to be Retained. Since the primary purpose 
of the analysis of variance is to obtain an accurate value for the error 
m.s., the number of figures which it is worth retaining during the cal- 
culations depends on the size of this figure. This is not known in ad- 
vance, but often a rough idea of the coefficient of variation (ratio of the 
standard error to the mean of the experiment) is available from previous 
experience. For the original data from which the analysis of variance is 
computed, a crude rule which errs on the safe side is as follows. Record 
the original data to 4 significant figures if the coefficient of variation is 
between 0.4 and 4%, to 3 if it is between 4 and 40%, and to 2 if it exceeds 
40%. Let us apply this rule to the eelworm data. Since experiments on 
eelworms are not very common, there is not much basis on which to 
predict the coefficient of variation. Suppose that we guess that it is un- 
likely to be below 20% (it actually turned out to be 40%). Then, for 
safety, data should be recorded to 3 significant figures. Since the average 
eelworm number per plot is around 300, this means that data are re- 
corded to the nearest eelworm, which is as accurately as they could be 
recorded. 

A more precise rule, requiring a little calculation, is that the rounding 
interval should not exceed one-quarter of the standard error per observa- 
tion. To apply this rule to the eelworm data, starting with the same 
premise that the coefficient of variation is 20%, we note that the mean 
per plot in the experiment is about 300. Thus the predicted value of the 
standard error per observation is about 60, so that the rounding interval 
should not exceed 15. It will be quite satisfactory to round each ob- 
servation to the nearest 10 eelworms. The discrepancy from the result 
given by the previous rule illustrates the fact that the first rule tends to 
be too conservative. 

The advisability of deliberately rounding records already taken should 
be left to the judgment, of the experimenter. With modern calculating 
machines it may be more expeditious to carry out the computations with 
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a few unnecessary figures than to take time to perform and check the 
rounding. The rules are most likely to be useful where computing de- 
vices are poor or unavailable. 

In the analysis of variance itself, it is generally as well to carry the full 
number of figures obtained from the uncorrected sum of squares; e.g., if 
the original data contain one decimal place, the sum of squares and hence 
the analysis of variance will contain two decimal places. This number 
will usually be excessive, but no more time is likely to be wasted in 
writing down the unnecessary digits than in estimating how many digits 
should be retained. 

In the final presentation of results, on the other hand, superfluous 
digits should be strictly avoided. They make the conclusions more 
difficult, to grasp, impede rapid mental comparisons, and give some 
readers an erroneous impression of the accuracy of the results. A treat- 
ment mean should be rounded to one-tenth of its estimated standard 
error; that is, if the estimated standard error of the treatment means is 
2.56, the means could be rounded to 0.256 and should be rounded to 0.1, 
i.e., to one decimal place, since this is the nearest rounding interval that 
is convenient. In some experiments (e.g., those of the split-plot type) 
different treatment comparisons have different standard errors. Here 
the rounding interval should be decided from those comparisons that 
have the smallest error. 


3.34 Identification of Data. The practice should be followed of label- 
ling all data, from original records to summary tables, so that; another 
person at a later time can tell what the data mean. Much data, ex- 
pensive to collect and potentially valuable for some research, have had 
to be discarded or destroyed because the original collector cannot be 
reached, and new investigators are unable to discover what measure- 
ments were taken and under what circumstances. Sometimes the units 
in which the observations or summary tables are presented cannot be 
found, or the crop on which a field experiment was carried out, or the 
nature of the experimental treatments. Frequently, data serve their 
most fruitful purpose when some later investigator gathers from many 
places all the material bearing on some question, and it should be a habit 
to facilitate rather than hinder such research. 


3.4 Subdivision of the Sum of Squares for Treatments 


3.41 Reasons for Subdivision. Often an experiment is planned to 
provide the answers to a number of different questions, not necessarily 
connected with one another. The eelworm experiment is a partial 
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though not an ideal example. One way in which the questions asked 
might be framed is as follows. (i) Is soil fumigation effective? Here we 
are interested in the average reduction in eelworm numbers taken over 
all fumigants. (ii) Are there differences in the effectiveness of different 
fumigants? For this question, comparisons of the reductions due to dif- 
ferent fumigants are relevant. (iii) What is the relative effectiveness of 
single and double doses? Since the simplest reasonable hypothesis is 
that the reduction in eelworm numbers to the double dose is twice that 
to the single dose, we might narrow this question by asking: Is the aver- 
age response to fumigation proportional to the amount of dressing, or in 
other words is the average response curve linear? (iv) Finally, if there is 
some indication of curvature in the responses, we might ask whether the 
amount of curvature is the same for all fumigants. 

With experiments of this type the F-test of the complete treatments 
m.s. is not particularly helpful because it is directed, as it were, at a 
mixture of several diverse questions. By an extension of the analysis of 
variance, we can subdivide the treatments s.s. into a number of com- 
ponents that are more relevant to the individual questions. Moreover, 
an F-test. can be made on the mean square for each component. The 
rules for subdivision are given in succeeding sections. Hffective use of 
the device requires long practice applied to a considerable number of ex- 
periments. 


3.42 Subdivision into Single Components. The simplest case, where 
all treatments have the same number r of replications, is considered first. 
The calculations are best made from the treatment totals 7';. Examples 
of the sort of quantity that we are interested in studying are 


T, + T: 
Ty — 73; Soren eae Ti Ty = (Lp + T,) 


Quantities of this type are called linear functions of the 7’s. Note that 
the sum of the coefficients of the T's is always zero, as it must be if the 
quantity is to represent a comparison among the 7s. These ideas may 
be formalized as follows. 

Definition. Any linear function 


2w = bwiT1 + boos e+ ++ lox 
is called a comparison among the T’s if 


bor + we +++++ lon = 0 
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Rule 1. If 2» is any comparison among the T’s, the quantity 


2 
Fe" where Dy = r(lot? + lus? ++++-+ lu?) 
Dw 
is a component of the sum of squares for treatments and represents 1 d.f. 
This rule gives the divisor D,, to be used so that the comparison zy 
may be included as a part of the treatments s.s. in the analysis of vari- 
ance. Incidentally, the rule implies that 2?/D » cannot exceed the treat- 
ments s.s. If it is found to do so, this is a sure sign of a mistake in com- 
putation. 
Definition. Two comparisons 2; and 22 are said to be orthogonal if 


hiiler + hiolog +++-+ lirlor = 0 


This “sum of products” relation is very important. By theory, it 
ensures that 2; and 2 are distributed independently of each other. Its 
relevance to the partition of the treatments s.s. appears in rule 2. 

Rule 2. If 2, and zp are orthogonal, then z:”/D» is a component of 

27 

Treatments s.s. — — 

1 
This means that, if we divide the treatments s.s. into the contribution 
from a comparison z; and the remainder, and now wish to subdivide the 
remainder, we must choose comparisons that are orthogonal to z;. Sim- 
ilarly, after removing the contribution of za, the next comparison 23 must 
be orthogonal to both 2; and za, and so on. This leads to the next rule. 

Rule 3. If the comparisons 21, 22, «++, 2,—1 are mutually orthogonal, 
i.e., every pair is orthogonal, then 

2 2 2 


Treatments s.s. eee a 
et pea Dsahe ay) Dyin 


®k-1 


This algebraic identity partitions the treatmentss.s. [which has (k — 1) 
degrees of freedom] into (& — 1) components, each representing a single 
degree of freedom. ‘ 

Starting with a specified 2;, we can always find 2g, 2g, +--+, 2:1 80 as to 
construct a complete orthogonal set. In fact, it may be proved that there 
is considerable freedom of choice in this process. When all but the final 
z;-1 have been selected, there is only one possible choice for 2,1, but at 
any previous stage many 2’s can be found that are orthogonal to all pre- 
ceding z's. The experimenter must choose the z’s that are most relevant 
for purposes of interpretation. 
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Table 3.4 shows the single components most frequently used for inter- 
pretative purposes when there are three or four treatments. It is as- 


TABLE 3.4 SaMPLe SETS OF SINGLE COMPONENTS 
Three treatments 


i, Equally spaced increments of one ii. Two qualities of an ingredient and 


ingredient, a control 
ca 2 rT 22, 

Ty =e 1 T, (0) —2 0 

T. 0 -2 Ts (a) 1 -1 

Ts 1 1 Ts (aa) 1 1 

Component Lin. Quad. Effect Quality 

divisor D 2r* 6r Component of a diff. 
divisor D Gr 2r 


Four treatments 


i. Equally spaced increments of one ii. Two comparable types of ingredient, 
aand b, and two qualities of each type 


ingredient 
2 2 2 2 2 23 
T -3 +1 -1 Ty (a) +1 +1 0 
T2 —1 —1 +3 T2 (a2) +1 -1 0 
T3 +1 =1 -3 Tz (bi) -1 0 “pl 
Ts +3 +1 +1 Ts (ba) =1 0 =I 
Component Lin. Quad. Cubie Quality Quality 
D 20r 4r 20r diff. diff. 
Component a vs.6 withina within 6 
D 4r 2r 2r 


iii. Different levels of each of two different ingredients a and b 


4 29 23 
Ty (aibi) -1 -1 +1 
T2 (abi) Fi -1 -1 
Ts (abe) =) +1 -1 
Ts (aabe) ct! ary! +1 


Average Average 
Tesponse response 
Component toa tob Interaction 
D 4r 4r 4r 


r* = number of replicates, assumed the same for all treatments. The coefficients 
given above are not valid if r varies from treatment to treatment. 


sumed that all treatments have the same number of replicates. The 
divisors D are those required for inserting the square of z in the analysis 
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ot variance. The reader should verify that these divisors satisfy rule 1, 
and that each set of z’s forms a complete orthogonal set. 

With three treatments the two sets presented are essentially the same, 
though the meaning attached to each z is different. Case i applies in an 
experiment where we have, say, zero, single, and double dressings of a 
fumigant, and wish to isolate the linear and quadratic components of the 
response curve. The response found by fitting a straight line to the three 
levels is represented by 21, while zg measures the deviation from a linear 
response. Case ii arises when we have comparable dressings of two dif- 
ferent fumigants, and wish to examine the average effect of fumigation 
(21) and the difference between the fumigants (ze). 

With four treatments, case i shows zero, single, double, and triple ap- 
plications of the same ingredient; here the response curve can be divided 
into its linear, quadratic, and cubic components. Alternatively, in case ° 
ii, we might be testing the wearing qualities of two types of “100% wool” 
suits and two types of suits made of wool-rayon mixtures. Case iii is 
used, for instance, when we examine the effects of two different amounts 
of sugar and two different amounts of vanilla in the preparation of a cake, 
Experiments of this type are discussed more fully in chapter 5. 

When different treatments have differing numbers of replicates, the 
rules are changed slightly. If the 7th treatment has 7; replicates, we have 
shown that the treatments s.s. is 

This Fa T.? ae T (T+ Te. +-+-+ 7)? 
ry r2 Tk on) 


The changes to be noted are: 
i. A linear function 


2w = bor + booTs +++++ bok T' 
is a comparison among the treatment totals 7’; if 
Tylor + Talwe +++ + ribwk = 0 
ii. The divisor required for zy” is 
Dy = Tibor? + relwo? +++ ++ telwk” 
iii. Two comparisons 2; and 22 are orthogonal if 


ryliiler + relates +++++ rebielor = 0 


3.43 Incomplete Subdivisions. The subdivision need not be com- 
plete in the sense that it is composed entirely of single components. In 
the eelworm experiment we suggested that the treatments s.s., with 8 
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components, might be divided into the following parts: (i) a single com- 
ponent representing the average response to fumigation; (ii) a single 
component representing the deviation of the average response from lin- 
earity; (iii) a part representing differences in the responses to the dif- 
ferent fumigants. Since there are 4 fumigants, this part comprizes 3 
components; (iv) a part representing differences in the curvature of the 
response curves for different fumigants: this also has 3 components. 
Some rules for partial subdivision will be given and applied to this ex- 
ample. The most general rule is as follows. 

Rule 4. A set of g quantities Q; are independent components of the 
treatments s.s., with n; degrees of freedom respectively, if 


ea”, zig” Zing” 
Se a ee Oe 


ae Dy Dig Din; 


where all z;; are mutually orthogonal comparisons, and the D;; are the 
appropriate divisors. Further, the remainder 


Treatments s.s. — >> Q; 
is an independent component of the treatments s.s. with 
* [{(& — 1) — >} ni] degrees of freedom 


This rule, which is a deduction from previous rules, is not very helpful 
to the beginner. With experience it often becomes easy, without carry- 
ing out the details, to see whether a set of Q; can be expressed in this 
way. Some more specialized rules follow. 

Rule 4a. If T;, T2, ---, Ty are the totals for any set of treatments, 
then 


gay Mo et tm 
" T re yn 


is a component of the treatments s.s., with (p — 1) degrees of freedom. 

This rule states that the sum of squares of deviations among any sub- 
group of the treatments, with proper divisors, is part of the treatments 
8.8. 

Rule 4b. Sometimes the treatments (or part of them) can be divided 
into a number of groups. The number of treatments in a group need not 
be constant, and different treatments may have different amounts of 
replication. Let 7;; be the total for the jth treatment in the ith group, 
and let 

Si = Ta + Tre +-°++ Tips 


Ry = ra + rig ++ +++ rip; @ = 1,2, +++) 
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Then let 
To? Te? Tig Se " 
Q = $e tS 1,2) 
Ti Ti2 Tip; Ri 
S,? S2  (S;-+:---+S,)? 
Qe41 Seen Se “a 
Ry R, Ry +:+++ Rg 


In this case Q; represents the sum of squares of deviations among the 
treatments in the ith group, with (p; — 1) degrees of freedom, while 
Q,41 is the sum of squares of deviations among the group totals with 
(g — 1) degrees of freedom. These Q’s are independent components of 
the treatments 5.8. 

Rule 4c. This applies when all treatments have the same number r of 

replicates, and every group contains p treatments. Let 
2 =hTa + ble +--+ blip 
where the set of U’s remains the same in all groups. Then 
(ei; — 2)? 

<2 Si 
is a component of the treatments s.s., with (g — 1) degrees of freedom. 
This rule is useful when we wish to compare linear functions of the treat- 
ments within each group. Note that the divisor in @ is the same as the 
divisor that would be used for a single 2;. 

Applications. In the eelworm experiment, the average effects of fumi- 
gation are obtained from a comparison of the total for the control with 
the totals for the single and double levels of fumigation. Consequently, 
we might regard the 9 treatments as divided into 3 groups: the control, 
the single levels, and the double levels. By rule 4b, this gives the follow- 
ing partition of the treatments s.s. (the treatment totals are found in 
table 3.2, p. 50). 


Between fumigants (single level): 


(8 df.) 
(1066)? + (928)? + (1431)? + (892)? (4317)° 
7 aie Crs 45,461 


Between fumigants (double level): 


(3 df.) 
(1265)? + (877)? + (1241)? + (1122)? (4505)? 
= = 23,641 
4 16 
5858)? 4317)? 4505)" 14,680)” 
Between levels: 6 pactad ie au ) ( 48 ) = 88,347 


(2d.f.) 
The three sums of squares add to 157,449, in agreement with the treat- 
ments s.s., 157,448, as given in table 3.3, p. 51. 
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The next step is to divide the sum of squares between levels into a 
component representing the average response to fumigation and one 
representing the deviation from a linear response. By least squares 
theory, the former is given by a comparison of the double level with the 
zero level. That is, we may take 

a = Se — So = 4505 — 5858 = —1353 


where the subscript refers to the level. The deviation from linearity is 
measured by the orthogonal comparison 


22 = (Sz — S81) — (Si — So) = S2 + So — 25; 
= 4505 + 5858 — 2(4317) = 1729 


Since each S is a total over 16 plots, the divisors are 32 and 96 re- 
spectively. Hence the contributions to the sum of squares are: 


; (1353) 
Average linear response: 2 = 57,207 
(1729)? 
Average curvature: 96 = 31,140 
D 


The subdivision found thus far might be presented as shown below. 


df. 8.8. ms. 


Between treatments 8 157,449 19,681 
Average linear response 1 57,207 57,207 
Average curvature ai 31,140 31,140 
Between single levels of fumigants 3 45,461 15,154 
Between double levels of fumigants 3 23,641 7,880 


This is not the subdivision envisaged at the beginning of this section. 
Instead, it was proposed to divide the final 6 d.f. into 3 representing dif- 
ferences in the responses to individual fumigants and 3 representing dif- 
ferences in the curvatures of the individual response curves. This 
separation is more difficult, 

Consider a comparison of (CN) and (CS). Estimates of the difference 
between these fumigants are available both at the single (ICN) and the 
double (2CN) level. If the effects of both fumigants are proportional 
to the amounts of dressing, the true difference at the double level will 
be twice that at the single level. On this assumption, two independent 
estimates of the difference between (CN) and (CS), for a single dressing, 
are 
(CN) — (2CS)] 


{(1CN) — (1CS)] and : 
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The second estimate is considerably more accurate than the first, since 
its variance is only 14 as large. Statistical theory shows that such 
estimates are combined by weighting each inversely as its variance. 
Hence, on the assumption of linearity, the most accurate estimate of 
the difference per unit dressing is 
[(1CN) — (1CS)] + 4)(@)[2CN) — 2CS)] 
1+4 
& [(QCN) — (1C8)] + 2[(2CN) — (2CS)] 
5 


(3.18) 


This shows that differences in the linear responses to the 4 fumigants 
are measured by comparisons of the quantities (ICN) + 2(2CN), ete. 
These quantities are given below. 


(Single level) + 2(double level) 
(CN) (CS) (CM) (CK) Total 
3596 2682 3913 3136 13,327 


It is for quantities of this kind that rule 4c is useful. This rule shows that 
the sum of squares of deviations of these quantities, when divided by 
r>. U7, or (4) (1)? + (2)"], i.c., 20, is a component of the treatments 8.8. 
This gives 43,408, representing differences in linear responses to the 
fumigants. 

We now consider the final three components. The curvature of an 
individual response curve is measured by a comparison of the type 


[(2CN) — (1CN)] — [(1CN) — (0)] = [2CN) + ) — 2(LCN)] 


where (0) represents the total number of eelworms over 4 plots for the 
control treatment. When the difference between two fumigants is taken, 
the (0) term disappears and we obtain the comparison 


[@ON) — 2(LCN)] — [(2CS) — 2(1C8)] (8.19) 
It is easy to verify that this is orthogonal with comparisons like (3.18). 
Consequently differences in curvature are compared by means of the 
quantities shown below. 
2(Single level) — (double level) 
(CN) (CS) (CM) (CK) Total 
867 979 1621 662 4129 


Rule 4c again applies. The divisor is 20, and the sum of squares con~ 
tributes 25,693. The final separation is given in table 3.5. It is evident. 
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from F-tests that none of the mean squares is significant at the 5% level, 
and only that for the average linear response approaches near to the 
5% level. These results must not be taken as final conclusions for this 
experiment. As will be seen in section 3.8, additional data were recorded 
in this experiment which permit a more accurate analysis. 


TABLE 3.5 Svuspivision OF THE TREATMENTS 8.8. FOR THE EELWORM DATA 


Source of variation df. 8.8. m.8. 
Average linear response 1 57,207 57,207 
Average curvature 1 31,140 31,140 
Differences in linear response 3 43,408 14,469 
Differences in curvature 3 25,693 8,564 
Error 36 544,690 15,130 


A more extensive discussion of this general topic, with a number of 
examples, is given in Snedecor’s book (3.4), chapter 15. It is worth re- 
peating that the amount and type of subdivision that should be done de- 
pend on the experiment, and to some extent on individual taste. As will 
be seen in the next section, any single component can be tested by means 
of a t-test derived from the treatment means, and some workers prefer to 
make the test in this way. 


8.5 Calculation of Standard Errors for Comparisons among 
Treatment Means 


3.51 Rules. We first give a rule (rule 5) that is more general than 
we need, because it is sometimes useful for other purposes. 
Rule 5, The standard error of any linear function 


2 = hy + lye +++++ lyynv 


of the individual observations is 


a, =o VU? + ly? +++ + ly? 


The estimated standard error is obtained by substituting s for ¢, where s 
is the square root of the error m.s. in the analysis of variance. This rule 
supplies the standard error of any kind of linear function, provided that 
it has been expressed in terms of the individual observations. For linear 
functions of the treatment means, one of two specisl cases of the rule is 
used. 

Rule 5a. If all treatments have the same number r of replications, the 
standard error of a linear function 


2 = hi + leo +++-+ lige 
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of the treatment means 7; is 
oe Vi te tte 
= Vr 


Rule 5b. If the ith treatment has 7; replications. the standard error of 


Z218 
i? i? ie 
o2=0,(—-+— ++ 
TI T2 Tk 


As in the general case, we substitute s for ¢ in order to obtain the esti- 
mated standard error. Finally, a rule that may save labor in complex 
cases is as follows. 

Rule 6. If the linear functions 21, 22, «++, Zp are mutually orthogonal, 


the standard error of 


2 = ey + loeo +++++ Lpty 


is 


Og = Vy204? + ly?o9? +--+ + 1,70? 


where o,” is the variance of 2;. 

This rule enables the work to be done in two stages. Sometimes, in a 
large experiment, it is relatively easy to express the function z in terms 
of a number of familiar orthogonal functions, but rather tedious to write 
z in terms of original observations. The variances of the z; may be found 
by rule 5 and substituted in rule 6. Note that different z; may involve 
the same set of observations, provided that the z; are orthogonal. 


3.52 Examples. These rules will now be illustrated by application to 
the celworm experiment. The treatment means are shown in table 3.6, 
the figures in parentheses denoting the order in which the treatments are 


numbered. 


TABLE 3.6 M®aN NUMBERS OF EELWORMS PER PLOT 


Fumigant 
Level (CN) (CS) (CM) (CK) Means 
0 366(9) 366 
1 266(5) 232(6) 358(7) 223(8) 270 
2 316(1) 219(2) 310(8) 280(4) 281 


From table 3.5 the standard error per plot is VV (15,180), or s = 123, 
with 36 d.f. 
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Example 1. Standard error of the average linear response. This is 
measured by the average response to the double dressing, ie., 


z=4(y1 + Go + Gs + Hs) — Yo = —85 
Since the first 4 means are based on 4 replicates, while yo has 16 repli- 
cates, we apply rule 5b, which gives 


0 (GO OOWO0G) 


123 
=— = 43.5 


V/8 


The value of Student’s ¢ is 85/43.5, or 1.95. 

As mentioned previously, the test of any comparison, as made above, 
is identical with the F-test of the corresponding component in the 
analysis of variance. The basic relationship is ” = (?; that is, if the 5% 
values of ¢ for n degrees of freedom are read from the table and squared, 
their squares are the 5% F values for 1 and n degrees of freedom, and 
similarly for any other significance level. This result may be verified in 
the present instance. The value of is 3.80, with 36 df. From the 
analysis of variance in table 3.5, the / value for the average linear re- 
gponse is 57,207/15,130 or 3.78, with 1 and 36 d.f., the two values 
agreeing apart from rounding errors. The reader may check that the 
same agreement holds for the average curvature. 

Example 2. Response to the single dressing of (CN). The response is 
measured by 


z= 95 — Yo 


Once again rule 5b is appropriate. 


1 1 123V5 
s; = 123 ,/-+—= = 68.8 
Laat 4 


Example 3. Difference between the linear responses to (CN) and (CS). 
In section 3.48 it was shown that this difference is 
vad Gs — Je + 291 — 292] 
5 


Since all means have 4 replicates, rule 5a may be used. 


o = oo VEU = (12.3)(V10) = 38.9 
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Example 4. Although the experiment is not complex enough to ex- 
hibit a profitable application of rule 6, this example shows that the rule 
agrees with the other rules. Suppose that at the double level we had 
compared (CN) with (CS), and also the two chemicals with the two 
proprietary mixtures. The comparisons are 


father 


A=f-% @2= 
1 ; 2 2 2 
By rule 5a their standard errors are 
o o 
o, => V2; ce i 2 


and it will be noted that the two functions are orthogonal. 
If now we wished to compare the double dressing of (CN) with the 
mean of the 3 other double dressings, the comparison would be 


Jo + Is + Ya 
3 


Se 


By a direct use of rule 5a, we have 


ie: ae eee ey, 
Cala oIN| @ gone One 3 


But alternatively we may write 


and apply rule 6 to give 


an pete OO *QO-a 


in agreement with the more direct method. 


3.53 Testing Effects Suggested by the Data. In order that /- and t- 
tests be valid, the tests to be made in an experiment should be chosen 
before the results have been inspected. The reason for this is not hard 
to see. If tests are selected after inspection of the data, there is a natural 
tendency to select comparisons that appear to give large differences. 
Now large apparent differences may arise either because there are large 
real effects, or because of a fortuitous combination of the experimental 
errors. Consequently, in so far as differences are selected just because 
they seem to be large, it is likely that an undue proportion of the cases 
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selected will be those where the errors have combined to make the dif- 
ferences large. The extreme case most commonly cited is that of the ex- 
perimenter who always tests, by an ordinary ¢-test, the difference be- 
tween the highest and the lowest treatment means. Tf the number of 
treatments is large, this difference will be substantial even when the 
treatments produce no real differences in effect. It may be shown that 
with 3 treatments the observed value of t will exceed the 5% level in the 
table about 13% of the time. With 6 treatments the figure is 40%, with 
10 treatments 60%, and with 20 treatments 90%. When the experi- 
menter thinks that he is making a ¢test at the 5% level, he is actually 
testing at the 13% level, or the 40% level, and so on. To summarize, 
the selection of those differences that look large and therefore “in- 
teresting” invalidates the ordinary tests of significance. The effect is to 
obtain too many significant results, or to raise the significance level of 
the test from the presumed 5% to some higher level, usually unknown. 
On the other hand, the rule that no tests must be constructed after 
seeing the data seems contrary to sound scientific practice. Often the 
initial experiments in a line of research are conducted in order to “‘see 
what happens”; the research worker does not know in advance what 
comparisons he may wish to test. And even where hypotheses about 
the nature of the results can be set up, experiments often indicate 
strongly, as is later confirmed, that these hypotheses are erroneous and 
that the method of analysis based on them is inappropriate. Con- 
sequently, although the effects of selection cannot be ignored, they should 
not deter the experimenter from the most careful examination of his 
data. Any difference that is of interest, whether anticipated or not, 
should be tested. If the ordinary tests show that the difference is not 
significant, then at least we know that the difference can be accounted 
for without supposing any real effect. If an unexpected difference 7s 
statistically significant, the possibility of a selection effect should be 
borne in mind. When the point at issue is important, the best pro- 
cedure is to conduct a new experiment specifically designed to confirm 
or disprove the indications from the previous results. 
The reporting of such effects presents difficulties. The following is a 
suggestion. “The experiment also indicated an unexpected effect of 
Although this was statistically significant at the 5% level, it is 
contrary to previous experience, and must at present be regarded with 
caution. Further work on this point will be undertaken.” The diffi- 
culties with this kind of writing are that we do not wish to imply that 
the effect is definitely established, and yet the writing should not en- 
courage the reader to ignore any result that is contrary to our precon- 


ceived notions. 
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3.6 Subdivision of the Sum of Squares for Error 


3.61 Reasons for Subdivision. The sum of squares for error can be 
partitioned into components in the same way as that for treatments. 
Although such subdivisions are not so frequently required as with treat- 
ments, they have a number of uses. Sometimes there is reason to believe 
that the error s.8. is not homogeneous; that is, the residual errors e do 
not all have the same variance o? as postulated in the mathematical 
model, In this case, as will be shown in section 3.63, subdivisions of the 
error may be necessary in order to obtain valid ¢tests. Occasionally a 
gross error in some observation may be suspected, and it is helpful to 
calculate the contribution of this observation to the error s.s. In ad- 
dition, a subdivision may help in understanding the nature of the error 
m.3s, 


3.62 Rules for Subdivision. These follow the same general pattern 
as with treatments, though they are a trifle more complicated. It is 
necessary to go back to the normal equations (3.4) from which the treat- 
ment and block effects were estimated. These equations took the form 


Ds Ope m — t; — bj) =0 (8.4) 


In other words, the residuals that provide the estimates of error must 
add to zero over any treatment or over any block. Consequently, if the 


linear function 
2 = Dd lisnyisn 


is to be a component of the error, the coefficients 1;;, must sum to zero 
over all observations that receive any specified treatment and also over 
all that are in any specified block. These are the tests by which we tell 
whether any proposed 2 is part of the error. 

For any z which satisfies these tests, the contribution to the error 8.8. 


is z2/D, where by rule 5 
D= Dik 


We now examine some components of the error s.s. in the eelworm ex- 
periment. Obviously, the difference z between the eelworm numbers on 
any two control plots in the same block satisfies the conditions. Sim- 
ilarly any comparison among the 4 control observations in a block is a 
component of the error. It follows that the sum of squares of deviations 
of these 4 observations from their mean contributes 3 d.f. to the error 
ss. The 4 blocks together contribute 12 df. of this type of error. The 
sum of squares will be found to be 307,312. This component is a meas- 
ure of the amount of variation among observations that receive the same 
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treatment and lie in the same block, and is the type of component that 
we should expect to constitute the error. However, this type of com- 
ponent is encountered only when some treatments have been replicated 
within the block. 
Consider now the difference between two other treatments, say (1CV) 
and (1C'S). Let 
a = (1CN); — (1CS)1 


where the subscript 1 denotes that the two observations come from 
block 1. This function is not a component of the error, for, while the 
coefficients add to zero over every block, they do not do so over treat- 
ments (ICN) and (10S). But if we take 


Z1 — 22 = (1CN); — (1CS); — (1CN)2 + (1CS)2 


the conditions are satisfied. This expression shows how the difference 
between two treatments changes from one block to another. It is a 
measure of the effect of the blocks on the treatment difference, and is 
usually called an interaction of treatments with blocks. Such inter- 
actions are the typical components of error. 

If z; is the difference between (1CN) and (1CS) in the ith block, any 
comparison among the 2; satisfies the rules required for a component of 
the error. Hence the sum of squares of deviations of the z;, when divided 
by 2, contributes 3 d.f. to the error s.s. A similar result holds if we take 
any comparison among the treatments and calculate it separately for 
each block. Since the 9 treatments provide 8 independent comparisons, 
we see that the 24 d.f. for the interactions of treatments with blocks may 
be divided into 8 sets of 3 d.f. The sum of squares for these 24 d.f. can 
of course be found by subtraction as shown in table 3.7. 


TABLE 3.7 Susppivision OF THE SUM OF SQUARES FOR ERROR 


df. 8.8. m.s. 
Error 36 544,690 15,180 
Among controls 12 307,312 25,609 
Treatments X blocks 24 237,378 9,891 


The F-ratio of the two components of error is 2.59, with 12 and 24d.f. In 
the ordinary table this corresponds to a significance level of about 244%. 
It is probably more correct to make a two-tailed test by doubling this 
probability. Hither test indicates that the controls are more variable 
than the other treatments. This is not very surprising, since in data of 
this type the variance may tend to increase as the mean increases. In 


oc —— 
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any event the result throws doubt on the suitability of the original 
mathematical model; it would seem better to ascribe different error 
variances to the control and the fumigants. 

The control also contributes to the treatments X blocks component 
of the error. The 8 comparisons among the 9 treatments can be divided 
into 1 between the control and all fumigants, and 7 which are com- 
parisons among the fumigants themselves. The former will contribute 
3 df. to the treatments X blocks interaction; the latter, 21 d.f. Tf the 
control is more variable than the fumigants, we would expect the mean 
square for the 3 d.f. to be larger than that for the 21 d.f., though it should 
be remembered that a mean square with only 3 df. is poorly determined. 
In the next section these two components will be computed separately. 


3.63 Calculation of a Separate Error for a Treatment Comparison. As 
the eelworm example illustrates, certain treatments may be erratic in 
their effects, while others in the same experiment show more stable re- 
sults, When this occurs, comparisons involving the erratic treatments 
have a higher experimental error variance than those among the more 
stable treatments. Since the error m.s. in the analysis of variance is some 
weighted average of these different variances, its use for individual 
tests may not be valid. In such cases it is helpful to be able to calculate 
a separate error for any specific treatment comparison. The procedure 
will be illustrated for the comparison between the control and the 
fumigants. 

The first step is to compute the comparison separately for each block. 


Totals in block 


1 2 3 4 Total 
Control 1814 1295 1769 980 5858 
Fumigants 2569 1780 2983 1490 8822 
2 (Control) —(Fumigants) 1059 810 555 470 2894 


Since there are 4 control plots and 8 fumigated plots in a block, the con- 
trol total is multiplied by 2 in forming the comparison. The contribu- 


tion to the error s.8. is 
(1059)? -+ (810)? + (555)? + (470)? ef (2894)" ohead 
24 96 
The divisor 24 is found by the usual rule. The mean square is 2954, 


with 3 d.f. By subtraction, the mean square for the interaction of fu- 
migants with blocks (21 df.) is found to be 10,882. Contrary to an- 
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ticipation, the component involving the control has the smaller mean 
square, though the difference does not approach significance. 

As often happens when the simple model is inadequate, the best 
approach for a more accurate analysis is questionable. It is possible 
to compute a separate error, by the method just given, for each ¢-test 
that is made. This is the procedure least open to criticism, since it 
makes no assumption that errors are constant from one treatment to 
another. Since, however, each t-test would be based on only 3 d.f., 
these tests are insensitive, so that this method should not be used with- 
out good reason. In the present case it is probably justifiable to pos- 
tulate only two error variances. The first, appropriate to the controls, 
is estimated by the mean square 25,609, with 12 d.f.; the second, for the 
fumigants, by the mean square 10,882, with 21 d.f. By this approach 
the variance of the mean response to the double dressing, for example, 
is estimated as 


10,882 25,609 
——_ + —— = 2281 


since each mean is taken over 16 plots. 

Separation of the error s.s. into single components is seldom required 
except in special studies or as an exercise. One method is to set up 8 
orthogonal comparisons among the treatments. Calculate each com- 
parison separately in each block, and let z;; be the value obtained for the 
ith comparison (¢ = 1, 2, ---, 8) in the jth block. Then find 


Way = 21 — 225 Wig = Zi + 22 — 22135 
Wig = ein + 29 + Zig — Seis 
Fach w is a component of the error, and the whole 24 w’s are mutually 


orthogonal. The squares of the w’s, each with its proper divisor, give a 
separation of the error s.s. into single components. 


3.7 Missing Data 


3.71 Method of Handling Missing Data in the Analysis. From time 
to time certain observations are missing, through failure to record, gross 
errors in recording, or accidents. The omissions naturally affect the 
method of analysis. With each of the common designs we give computa- 
tional instructions for analyzing data that contain gaps. The object of 
this section is to indicate the theoretical basis for these methods. 

When certain observations are absent, the correct procedure is to 
write down a mathematical model for all observations that are present. 
The least squares Normal equations are then constructed in the usual 
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way. These take exactly the same general form as when all observa- 
tions are present; i.e., 

DY Win —m — ti — bj) = 0 (3.20) 
over all observations whose equations contain any specified parameter 
that is to be estimated. Since, however, the terms in the equation cor- 
responding to missing observations are absent, the system of equations 
loses some of the symmetry that it possesses when all observations are 
present, and the solutions are more difficult. The same general procedure 
supplies F- and t-tests of hypotheses about the nature of the treatment 
effects as described in section 3.24, though again the details become more 
complicated. 

In the analysis of variance, two changes may be noted. Owing to 
the missing observations, the treatments and blocks s.s. become en- 
tangled, so that the treatments 5.5. must be computed after allowing 
for block effects, as mentioned at the end of section 3.26. Secondly, 
if a observations are absent, the total number of degrees of freedom 
is reduced by a. Unless one or more complete treatments or blocks 
is missing, the number of parameters required to describe these effects 
will be the same as before. Consequently, the missing degrees of free- 
dom all come from the error s.8., which now represents (%~¢ — a) degrees 
of freedom. In short, missing data may be handled by applying the 
standard least squares procedure to all observations that are not miss- 
ing. For future reference, this method will be called the “correct least 
squares procedure.” ' 

To the experimenter it may be a difficult. business to carry out the 
construction and solution of a set of unfamiliar normal equations, even 
though he is quite competent to analyze a set of complete data. For this 
reason Yates (3.9), following a suggestion by Fisher, considered inserting 
values for the missing observations 80 as to obtain a set of complete data. 
Suppose that only a single observation is missing, and that a value a is 
substituted for this observation. If the analysis of variance is calculated 
in the usual manner for complete data, the error s.8. is found to be of the 
form 

Aa? — 2Ba + C 
where A, B, and C are numbers that are determined by the type of de- 
sign and the values for the other observations (A is always positive). 

In order to find a numerical value for z, Yates proposed to use the 
value that minimizes the error s.s. This isz = B/A. If this value is in- 
serted in place of the missing observation, and if the data are analyzed 
as if no observations were absent, Yates showed that several important 
properties hold. (i) The estimates of treatment and block effects are 
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exactly the same as those obtained by the correct least squares pro- 
cedure. (ii) The error s.s. is exactly the same as given by the correct 
procedure. (iii) To obtain the correct partition of the degrees of freedom, 
we subtract 1 from the total s.s. and 1 from the error s.s. 

Yates also showed that the method of insertion fails to agree with the 
correct least squares procedure in two respects. The treatments s.s., as 
obtained in the analysis of variance of the “complete” data, is always 
slightly larger than the correct treatments s.s. for an F-test of the treat- 
ments. Unless an appreciable fraction of the total observations is miss- 
ing, this overestimation is unlikely to be large; further, the exact /’-test 
can be obtained by means of some additional calculations. The second 
defect of the method of insertion is that it may not give proper é-tests. 
That this will happen is clear, because in the analysis of “complete” 
data r replications are ascribed to the treatment that contains the missing 
observation, whereas there are only (r — 1) replications. To allow for 
this disturbance we give special rules which provide ¢-tests that are ap- 
proximately correct. 

Thus the method adopted with a missing value is first to “estimate” 
this value by means of the formula B/A, which will be presented for the 
common designs. This estimate is used in place of the missing value, and 
the rest of the analysis is conducted as if the data were complete (except 
for the changes in degrees of freedom). Special methods are available 
for exact F-tests and for t-tests. If several observations are absent, a 
repeated application of the formula enables values to be substituted for 
each missing observation. 

This method is essentially an ingenious computational device whose 
purpose is to enable the easy computations that apply to complete data 
to be used even when data are incomplete. Substitution of estimates 
for the missing data does not in any way recover the information that is 
lost. through loss of data, as some experimenters have suggested, usually 
facetiously; it merely attempts to reproduce the results obtained by an 
application of the least squares method to the data that are present. 
‘The only complete solution of the ‘‘missing data” problem is not to have 
them. 


3.8 The Analysis of Covariance 


3.81 Purpose of the Technique. As indicated in section 3.2, experi- 
ments can be planned so that certain types of environmental effect are 
eliminated from the estimates of the treatment effects, with the result 
that these estimates are made more accurate. In the eelworm experi- 
ment, where the plots were grouped into blocks of 12 plots each, any dif- 
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ferences from block to block in the severity of eelworm infestation were 
eliminated in this way. On the other hand, differences in infestation 
from plot to plot within the same block are not controlled by the design and 
do contribute to the experimental errors, since treatments were assigned 
at random to plots within each block. Accordingly, before the fumigants 
were applied, samples were taken in order to estimate the natural in- 
festation on each plot. 

The analysis of covariance shows how to use these supplementary data 
to reduce the experimental errors by eliminating the effects of variations 
in the initial infestation within a block. The technique is potentially 
very useful. It often happens that some source of variation which can- 
not be controlled by the design can be measured by taking additional 
observations. Whenever this is so, the analysis of covariance can be 
utilized, often to great advantage. One caution is that, since the ad- 
ditional observations are to measure environmental effects, they must not 
be influenced by the treatments. The situation where such measure- 
ments are influenced by the treatments is discussed in section 3.88. 


3.82 Initial Steps in the Analysis. The first step is to construct a 
new mathematical model. If y;; refers to the final eelworm count and 
xijz to the initial, the relation is 

Yer = w+ Ti + By + Vize — B) + Cain (3.21) 
The only change is the introduction of a new term to describe the effect, 
of the initial eelworm number. We have assumed that the effect is 
linear, i.e., it is a constant multiple y of the amount by which the initial 
eelworm number 2; on the plot differs from the average initial number 


& for the whole experiment. 
As before, the unknowns are estimated by least squares. In this 


case we minimize 


D [yee — om — ts — bj — cee — DP (3.22) 
The normal equation for m is again of the form 
D lyn — m — te — 8; — cin — D] = 0 (3.23) 


over all observations. If the same linear restrictions as before are ap- 
plied to the ¢; and the b;, equation (3.23) implies that m is the mean of the 
yijx. The equation for ¢; is the same as (3.23) except that the sum is over 
those observations that receive t;. The equation may be rearranged as 


ram + riti = Tiy — (Tic — TH) (3.24) 
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where 7; is the number of replications, and 7’ denotes a treatment total. 
This gives 


vey est Giae (8.25) 
(2-4) 


The important feature of this equation is that in order to obtain ¢;, the 
observed treatment mean of the y’s (T'z,/r;) is adjusted, the adjustment 
depending on the treatment mean of the 2’s. It is this adjustment that: 
removes the effect of the initial infestation. 

The equation for c, the adjustment factor, will not be developed in 
full. It turns out that ¢ is given by the ratio of the error sum of products 
of y and z to the error s.s. of x. 


3.83 Computations. In practice we start with a joint analysis of the 
sums of squares and products of y and x. In order to illustrate certain 
features of the covariance technique, the treatments s.s. will be sub- 
divided into “linear” and “curvature” components, as was done for y 
in table 3.5 (p. 64). The original values for x are given above the y 
values in table 3.1 (p. 40). The treatment totals and the quantities 


TABLE 3.8 TREATMENT TOTALS FOR NUMBERS OF CYSTS * 


Before fumigation (x) After fumigation (y) 


Level CN CS CM CK Total) CN CS CM CK Total 


0 1975 1975 5858 5858 
1 402 417 513 570 1902 | 1066 928 1431 892 4317 
2 889 554 568 778 2289] 1265 877 1241 1122 4505 


2(2) + (1):L | 1180 1525 1649 2126 6480 | 3596 2682 3913 3136 13327 
2(1) — (2):Q| 415 280 458 362 1515 | 867 979 1621 662 4129 


* Numbers of cysts per plot were the numbers found in 400 grams of soil. Thus 
the totals in the table represent 1600 grams of soil, except for the no-fumigant totals, 
which represent 6400 grams. 


needed for isolating the linear response and the curvature are reproduced 
in table 3.8. The analysis of the sum of squares of x follows by the 
same methods as given previously for y. 

To analyze the sum of products, we carry out the same operations as 
for a sum of squares, except that at every stage a square is replaced by 
the corresponding product. A few examples will suffice. 


3.83 COMPUTATIONS 7 


Total: (269)(466) ++ (288)(280) +- -- + (48)(298) — e164) = 355,929 
Trestaients: (1975)(5858) , (402)(1066) Bhs, psc (778)(1122) 
16 4 4 
= woe = —9299 
Average linear response: (2280. a1 970) S500 iDene) = —13,276 


32 
The complete analysis of covariance is shown in table 3.9. 


TABLE 3.9 SuMS OF SQUARES AND PRODUCTS (% = BEFORE, y = AFTHR, 


FUMIGATION) 
af. (@’) (yx) (y) 
Blocks 3 159,618 175,873 280,427 | 
Treatments 8 29,142 —9,222 157,448 
Average linear response uf 3,081 —13,276 57,207 
Average curvature 1 2,204 8,285 31,140 
Differences in linear 3 22,975 —6,837 43,408 
Differences in curvature 3 882 2,606 25,693 
Error 36 121,408 189,278 544,690 
Total 47 310,168 355,929 991,565 


If we denote the sums of squares and products for error by Brx, Byy, 
and H., respectively, the adjustment factor ¢, or regression coefficient 
of y on 2, is given by 

Ey, 189,278 


Ey, 121,408 
It may seem surprising that the coefficient is greater than 1. This is 


probably explained by the great seasonal increase in the eelworm num- 
bers, obvious from table 3.8. 

The residual error s.s. may now be found. The original error 8.8. is 
544,690, with 36 df. To remove the effect of the regression on the 
initial eelworm numbers, we subtract 

Ey2 (189,278)? 
E,, 121,408 


2 
Eyz 


= 1.559024. 


= 


= 295,089 


By — —“— = 544,690 — 295,089 = 249,601 


LE 


Thus the residual s.s. is 249,601, with 35 d-f., since 1 df. must be sub- 
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tracted for the additional parameter y. The residual m.s. Spe a8 
249,601/35, or 7131. This is less than half the original mean square 
of 15,130, and indicates that the use of covariance has approximately 
doubled the accuracy of the experiment. 

One feature of the covariance method deserves comment at this point. 
In the model the effect of z was assumed to be linear, but no assumption 
was made about the strength of the effect. That is left to be determined 
by the data. If the eelworm numbers on each plot had remained un- 
changed throughout the season, except for the influence of the treat- 
ments, and if the eelworm numbers had been accurately measured, the 
residual s.s. would have been zero. The residual s.s. as found presumably 
represents a contribution due to seasonal variations in infestation and 
one due to the fact that the eelworm numbers were not estimated ac- 
curately, being obtained only from small samples of soil. If the variable 
x had had no linear effect on y, the residual m.s. would have been the 
same as the original mean square, apart from sampling fluctuations. 


3.84 The Adjusted Treatment Means. A little time is saved if the 
adjusted treatment totals are found first. From equation (3.24) the 
adjustment to the ith treatment total of y is 


—¢e(Tiz — TH) 
where r; is the number of replicates, and & the general mean of x (128.46). 
Thus for the control the adjusted total is 
5858 — 1.5590(1975 — 16 X 128.46) = 5858 + 125 = 5983 
and for (ICN) 
1066 — 1.5590(402 — 4 X 128.46) = 1066 + 174 = 1240 
The means are shown in table 3.10. 


TABLE 3.10 Apsusrep TREATMENT MEANS (BELWORMS PER 400 GM. SOIL) 


Fumigant 
Level of 
dressing CN cs CM CK 
0 374 
Aj 310 270 358 201 
2 365 204 289 178 


3.85 t-tests. These tests are slightly complicated by the fact that 
account must be taken of the sampling error of the adjustment factor c. 
The difference between two adjusted means may be written 


Ti — Jo — ci — 4) 
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From regression theory the variance of this quantity is given by 


my file sl 1 ot Geo wey? 
ove (. ap V+ G Be) o2 oe | fey =| (3.26) 


ry TT TY T2 Exe 


where Tyee is the residual error variance, and ,, the error s.s. for x. As 
an estimate of o,.,” we use the residual error m.s., in this case 7131 with 
35 df. 

Example. A t-test of the reduction to the double dressing of CS. The re- 
duction is (374 — 204), or 170 eelworms per sample. The estimated 
variance is 


1 Hn (123.44 — 138.50)? 
4 121,408 


1 
(7131) Fe ap | = (7131)(0.31437) = 2242 


(3.27) 
the means of the a’s being found by division from table 3.8. Hence 
ee ase 
2242 = 47.35 


with 35 d.f., which is highly significant. 
For a more complicated comparison of the adjusted means, 


D Wy: — e@: — 9] 
the estimated variance is 
1? 1%; o 
Sie [= —+ — (3.28) 
"% Ezz 


(The term in € vanishes because for any comparison the sum of the l’s 


must be zero.) 

One annoying feature of these tests is that the x values enter into the 
variance, so that every comparison necessitates a separate computation 
of the variance. As a time-saving approximation, Finney (3.10) has 
proposed that an average value for the contribution of the term in the 
2’s may be used. This amounts to using 


3643 
121,408 


Sys [1 te =| = (7131) [! a. | = (7131)(1.030) = 7345 


a 


as the effective residual error m.s., where tx is the treatments mean square 
for x. The term in brackets represents the average contribution from 
the a’s, or in other words the contribution from sampling errors in the 
adjustment factor c. 
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Thus, for the variance of the difference between the adjusted means 
of the control and 2CS, we would use 


(7345) (- es *) = (7345) ( 4 ) = 2295 
ae Se Gey) 


instead of the value 2242 given by the more exact expression (3.27). 
Similarly, instead of (3.28), we use 


12 
(7345) > () 
‘% 
This approximation is usually good enough if the number of error de- 
grees of freedom exceeds 20, since in such cases the contribution from 


errors in cis small. The more exact test may be used if n, is small. 


3.86 f-tests. The fact that the same adjustment factor appears in 
every treatment mean also influences F-tests. The procedure is shown 
in table 3.11. 


TABLE 3.11 F-rest wird THE ANALYSIS OF COVARIANCE 


Residuals 
a 
df. (2?) (y2) (y’) | af. 8.8. ms. 
Treatments 8 29,142 —9,222 157,448 | 8 237,192 20,649 
Error 36 121,408 189,278 544,690 | 35 249,601 7,131 
T+E 44 150,550 180,056 702,188 | 43 486,793 


The figures to the left: of the vertical line are from the previous analysis 
(table 3.9), and in practice would not be recopied. Form a new line in 
the analysis by addition of the items for treatments and error. From 
this value for (y’), ie., 702,138, subtract the contribution due to a re- 
gression on x, 

(180,056) 


= 215,345 
150,550 yi 


The remainder, 486,793, is entered in the column headed “residuals 8.8.” 
and carries 43 df., 1 being subtracted for the regression. The same 
process is completed for the error line; actually, this was already done in 
computing the residual error s.s. The residual s.s. for treatments is 
found by subtracting that for error from that for treatments + error 


237,192 = 486,793 — 249,601 
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It always has the same number of degrees of freedom as the original 
treatments s.s. The F-test of the adjusted treatment means is given by 
the ratio of the residual m.s. 


with 8 and 35 df. 

If it is desired to test some component of the treatments s.s., the same 
calculation is made with the component in place of the treatments s.8. 
throughout. If several components are to be tested, this becomes rather 
tedious. A useful approximation is to construct from the original anal- 
ysis of covariance in table 3.9 an analysis of (y — cx)”. This is most 
easily done by multiplying each term in (x?) by c”, or 2.43048, each term 
in (yx) by —2c, or —3.11805, and adding the two products to (y*). The 
results are shown in table 3.12, , 


TABLE 8.12 ANALYSIS OF VARIANCE OF (y — CX) 


df. 8.8. m8. PF Ff 
‘Treatments 8 257,032 32,129 
Average linear response 1 106,090 106,090 14.88 14.51 
Average curvature 1 10,664 10,664 1.50 1.47 
Differences in linear 3 120,566 40,189 5.64 5.05 
Differences in curvature 3 19,711 6,570 0.92 0.92 


Error 85 249,592 7,131 


It will be noted that this calculation gives the correct residual error 
m.s. However, sums of squares for components of the treatments are 
always larger than those given by the more roundabout correct pro- 
cedure. Thus the F values shown on the right are all too large. The 
overestimation is seldom great, and the 7’ values serve for a preliminary 
inspection. Those that are just beyond the significance level may be 
recomputed by the correct procedure if it is thought worth while. For 
comparison, the /’ values shown above are those obtained by the 
correct procedure (it is a useful exercise to check them). With either 
F or F’ there is a significant average linear response and significant 
differences among fumigants in their linear responses, while the curva- 
ture terms do not approach significance. In the analysis made without 
covariance (table 3.5, p. 64) no component was significant. 


3.87 The Increase in Accuracy Due to Covariance. From table 3.9 
we see that with no covariance the error 8.5. for y is 544,690, with 36 
df., giving an error m.s. of 15,130. A comparable figure when covariance 
is used is the effective residual m.s., 7345. This is preferable to the 
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residual m.s. itself (7131) because it makes allowance for the sampling 
error of the adjustment factor c. The accuracy obtained with covariance 
relative to that without covariance is estimated by 15,130/7345, or 2.06. 
The use of covariance appears to have had about the same effect as 
doubling the number of replicates. In making this comparison, we ig- 
nored the effect of the reduction in error d.f. from 36 to 35, because it is 
negligible. 

As in ordinary regressions, the x variable may be transformed to an- 
other scale if this is likely to produce a more linear relation with y. The 
use of log x, for instance, is common in biological work. Two or more 
different « variables may be used. The calculations for this case are 
described by Snedecor (3.4, chapter 13). 


3.88 The Case Where the x Variable is Influenced by the Treatments. 
Ina covariance analysis the treatment mean #; is adjusted by the amount 
—c(& — #). The effect of the adjustment is to change each #; to the 
value that it would be expected to have if all treatments had the same 
«mean. It is in this way that the technique removes the effect of 
variations in the #;. If, however, the differences among the #; are in part 
produced by the treatments, the adjustment removes part of the treat- 
ment effect and its interpretation is changed. Consequently, in the 
standard use of covariance, it is important to be sure that the treatments 
did not affect the « values. Sometimes this is obvious, as in the eel- 
worm experiment, because the x values were recorded before the treat- 
ments were applied. A more doubtful case is that of the number of 
plants per plot in a field experiment, counted after the application of 
the treatments, which may or may not influence plant numbers. An /’- 
test of the « values is helpful in such cases. 

Where the treatments do affect 2, a covariance analysis may add in- 
formation about the way in which the treatments produced their effects. 
In the eelworm experiment, the yields of the oats which were grown on 
the plots were obtained. Since eelworms attack oats, it would be in- 
teresting to know whether the effects of the treatments on the oats were 
simply a reflection of their effects on the eelworms. This is examined 
by a covariance analysis in which the oats yields are the y values, and 
the eelworm numbers after harvest are the x values. If the F-test of the 
adjusted treatment means still shows significance, the conclusion is that 
not all the treatment effect on the oats can be attributed to the reduction 
in eelworm numbers. This happened in the present instance, because 
some treatments supplied nitrogen to the crop, and therefore acted in 
part as fertilizers as well as fumigants. As Bartlett (3.11) has pointed 
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out, the interpretation of this use of covariance requires care, since a 
hidden extrapolation may be involved. 

When the x variables are influenced by treatments, the short-cut 
method for ¢-tests by use of the effective residual error and the approxi- 
mate F-tests by means of an analysis of (y — cr) should be avoided, 
since they may be seriously in error. 


3.9 Effects of Errors in the Assumptions Underlying the Analysis of 
Variance 


The assumptions made in the analysis of variance are that treatment 
and environmental effects are additive, and that the experimental errors 
are independently distributed in the normal distribution, with a common 
variance. In practice we can never be sure that these assumptions all 
hold, and often there is good reason to suspect that some are false. The 
consequences of failures in the assumptions and the remedial steps to 
be taken have been summarized recently by Hisenhart (3.12), Cochran 
(3.13), and Bartlett (3.14). Only a few comments will be given here. 

Asa rule, the failure of an assumption will affect both the significance 
levels and the sensitivity of F- and ¢tests. When the experimenter 
thinks that he is testing at the 5% level, he may actually be testing at 
the 8% level. Usually, though not invariably, the true significance 
probability is larger than the apparent one; that is, too many significant 
results are obtained. Also, there is usually a loss of sensitivity, in the 
sense that a more powerful test than the analysis of variance Fest 
could be constructed if the correct mathematical model were known. 
There is a corresponding loss of accuracy in the estimates obtained for 
the treatment effects, since these, too, could be made more accurate if the 
correct, model were known. 

Although generalization is hazardous, experience suggests that in the 
majority of experiments, at least in the field of biology, these disturb- 
ances are not sufficiently great to invalidate the technique. They do 
imply, however, that significance levels and confidence limits must be 
considered approximate rather than exact. For the same reason, the 
inflexible use of say the 5% significance level to divide the effects into 
those that are regarded as “‘real” and those that are not is hardly jus- 
tifiable. 

The most serious disturbances appear to arise when the experimental 
error variance is not constant over all observations. Sometimes this 
happens, as mentioned previously, because certain treatments are erratic 
in their effects. In such cases, the appropriate error variance for com- 
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paring one pair of treatments might be four times as large as that for 
another pair, and the use of the same estimated variance for both com- 
parisons would lead to t-tests that were completely erroneous. Where 
this type of disturbance is suspected, the remedy is to divide the error 
s,s. into components each of which is homogeneous, as indicated in 
section 3.63. 

The same problem may arise because the experimental errors follow a 
distribution that is decidedly skew. In such distributions the error 
variance for a treatment tends to be a function of the mean produced by 
the treatment. If the nature of the functional relationship is known, a 
transformation can be found that will place the data on a scale on which 
the error variance is more nearly constant. This transformation is then 
made on the observations before starting the analysis. The principal 
transformations that have been found useful are discussed by Bartlett 
(3.14); they include logs, square roots, and (for data expressed in frac- 
tions) inverse sines. 

Such transformations may also be useful in cases where treatment and 
environmental effects are not additive. If, for instance, a treatment in- 
creases all observations by 20%, irrespective of the initial level, a change 
to logs will introduce additivity. When transformations are made for 
this purpose, it should be realized that they will also affect the distribu- 
tion of the experimental errors. Fortunately, it often happens that such 
transformations also bring the distribution of errors closer to normality. 

Finally, the assumption that the errors are independent from observa- 
tion to observation may be obviously untenable. It is well known that 
crop yields on neighboring plots tend to be positively correlated, and in 
laboratory experiments observations made by the same person at about 
the same time tend to exhibit the same type of correlation. These cor- 
relations might completely vitiate tests of significance. The remedy in 
this case is the proper use of randomization, which, as it were, introduces 
independence in the assignment of treatments to the experimental units 
or in the assignment of the order in which observations are made, so that 
the errors may effectively be regarded as independent. For further dis- 
cussion of this question, see Yates (3.15), Fisher (3.3), Bartlett (3.16), 
and Welch (3.17). 
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CHAPTER 4 


COMPLETELY RANDOMIZED, RANDOMIZED BLOCE, 
, AND LATIN SQUARE DESIGNS 


4.1 Completely Randomized Designs 


4.11 Description. The simplest type of layout is that in which treat- 
ments are allotted to the units entirely by chance. More specifically, if 
a treatment is to be applied to four units, for example, the randomization 
gives every group of four units in the experimental material an equal 
probability of receiving the treatment. In addition the units should be 
processed in random order at all subsequent stages of the experiment 
where this order is likely to affect the results. 

This design has several conveniences: 

1. Complete flexibility is allowed. Any number of treatments and 
of replicates may be used. The number of replications can be varied at 
will from treatment to treatment (though such variation is not recom- 
mended without good reason). All the available experimental material 
can be utilized—an advantage in small preliminary experiments where 
the supply of material is scarce. 

2. The statistical analysis is easy even if the numbers of replicates are 
not the same for all treatments or if the experimental errors differ from 
treatment to treatment. 

3. The method of analysis remains simple when the results from some 
units or from whole treatments are missing or are rejected. Moreover, 
the relative loss of information due to missing data is smaller than with 
any other design. 

The principal objection to a completely randomized design is on the 
grounds of accuracy. Since the randomization is not restricted in any 
way to ensure that the units which receive one treatment are similar to 
those which receive another treatment, the whole of the variation among 
the units enters into the experimental error. For this reason the error 
can often be reduced by the use of a different design, unless the units are 
highly homogeneous or the experimenter has no information by which 
to arrange or handle the units in more homogeneous groups. 

Complete randomization seems the obvious procedure for many lab- 
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oratory experiments, e.g., in physics, chemistry, or cookery, where a 
quantity of material, after thorough mixing, is divided into small sam- 
ples or batches to which the treatments are applied. On the other hand, 
these designs are seldom used in field experiments, the method of ran- 
domized blocks having been found consistently more accurate. 

One fact compensates to some extent for the higher experimental er- 
rors as compared with other designs. For a given number of treatments 
and a given number of experimental units, complete randomization pro- 
vides the maximum number of degrees of freedom for the estimation of 
error. As pointed out in section 2.31, the sensitivity of the experiment 
increases as the number of error degrees of freedom is increased. Con- 
sider, for example, an experiment with 2 treatments in 4 replicates. The 
degrees of freedom for error are 6 under complete randomization as 
against 8 if the method of pairing (section 4.2) is used. From section 
2.31 it appears that the paired experiment must produce about a 14% 
reduction in the error variance in order to offset the additional un- 
reliability in the estimate of error. This point is worth bearing in mind 
with small experiments. 

To summarize, complete randomization may be appropriate (i) where 
the experimental material is homogeneous, (ii) where an appreciable 
fraction of the units is likely to be destroyed or to fail to respond, and 
(iii) in small experiments where the increased accuracy from alternative 
designs does not outweigh the loss of error degrees of freedom. 


412 Randomization. If the number of units does not exceed 16, 
tables 15.6 and 15.7 may be used. Suppose that there are 3 treatments, 
of which two have 4 replicates while the third has 8 replicates. Num- 
bers 1-16 are assigned to the units in any convenient order. A random 
permutation is drawn from table 15.7, say 


9, 13, 8, 5, 12, 1, 14, 16, 6, 7, 3, 4, 10, 11, 15, 2 


The first treatment is applied to units 9, 13, 8, 5; the second to units 12, 
1, 14, 16; and the third to the remainder. 

With more than 16 units in the experiment, numbered dises, beans, 
or a book of random numbers may be used, as described in chapter 15. 


4,13 Statistical Analysis. Table 4.1 shows some of the results of an 
experiment on the effects of applications of sulphur in reducing scab dis- 
ease of potatoes. The object in applying sulphur is to increase the acidity 
of the soil, since scab does not thrive in very acid soil. In addition to 
untreated plots which serve as a control, 3 amounts of dressing were 
compared—300, 600, and 1200 Ib. per acre. Both a fall and a spring ap- 
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plication of each amount was tested, so that in all there were 7 distinct 
treatments. The sulphur was spread by hand on the surface of the soil, 
and then disced in to a depth of about 4 inches. The quantity to be 


TABLE 4.1 Fred PLAN AND SCAB INDICES FOR A COMPLETELY RANDOMIZED 


Notation: F = fall, S = spring application, 0 = control. 


EXPERIMENT ON POTATOES 


F3 0 S86 | F12 | S6 | S12 $3 F6 
9 12 18 10 24 17 30 16 
0 83 | Fl2 | F6 $3 0 0 56 
10 ( 4 10 21 24 29 12 

F3 | 812 | F6 0 F6 | S12} F3 | F12 
i) 7 18 30 18 16 16 4 
83 0 $12 | 86 0 Fi2 0 F3 
9 18 17 19 32 5 26 4 


are the amounts of sulphur in 100 Ib. per acre. 


Source of variation 


Treatments 


Error 


Total 


Results grouped by treatments 


The numbers 3, 6, 12 


F3 83 F6 86 F12 
9 30 16 18 10 
9 a 10 24 4 

16 21 18 12 4 
4 9 18 19 5 

38 67 62 73 23 
9.5 16.8 15.5 18.2 5.8 

Analysis of variance 
df. 8.8. ms, 

t-1) = 6 972.3 162.0 
(N -—d) =25 1122.9 44.9 
(N —1) = 31 2095.2 


* Denotes significance at the 5% level. 


57 G=501 
14.2 


3.61 * 


analyzed is the “scab index.” This is, roughly speaking, the percentage 
of the surface area of the potato that is infected with scab. It is obtained 
by examining 100 potatoes at random from each plot, grading each po- 
tato on a scale from 0 to 100% infected, and taking the average. 
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The design was a completely randomized one, with 4 replications of 
each sulphur dressing and 8 replications of the control, which received 
extra replication in order to obtain a fairly good estimate of the natural 
infestation. Incidentally, a randomized blocks layout might have been 
superior. 

The computations are very simple. Let y denote an observation, T; a 
treatment total, @ the grand total, r; the number of replications of the 
ith treatment, N = >> 7; the total number of observations, and ¢ the 
number of treatments. 

Step 1. Find the treatment totals and the grand total. 

Step 2. The sums of squares are computed as follows. 


2 2 
Correction factor: C = s = - = 7843.8 
Total: 5 4? — C = (9)? + (12)? +++++ (4)? — 7843.8 = 2095.2 
2 181)2 2 2) ae 2 
Treatments: >) m C= { = + Boe8g) + ONS 7843.8 = 972.3 
Error: (Total s.s.) — (treatments s.s.) = 2095.2 — 972.3 = 1122.9 


The analysis of variance is given at the foot of table 4.1. The F- 
ratio for treatments is significant at the 5% level. From the treatment 
means it appears that all dressings had some beneficial effect, and that 
the fall application was more effective than the spring one, There is 
little or no evidence that the higher dressings were more effective than 
the lowest dressing. If the summary is conducted from this point of 
view, we should isolate and test two individual components of the treat- 
ments s.s.: (i) a component measuring the average effect of all dressings, 
(ii) a component comparing the fall and spring applications. The follow- 
ing computations are required. 

Average effect of sulphur. The total over all dressings is 320, repre- 
senting 24 plots. Since the control total, 181, represents 8 plots, the 
comparison is 

3(181) — 320 = 223 


The contribution to the sum of squares in the analysis of variance is 
(223)?/96, or 518.0. By the rules in section 3.42, the divisor, 96, is 
[(9)(8) + 24]. 
Fall versus spring application. The comparison is 
(38 + 62 + 23 — 67 — 73 — 57) = -74 

It is obviously orthogonal to the previous comparison. The contribu- 
tion to the sums of squares is (74)?/24, or 228.2. These calculations 
lead to the analysis of variance in table 4.2. 
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The remaining four components of the treatments s.s. must represent: 
comparisons among the levels of sulphur. The average reduction in scab 
due to sulphur is significant at the 1% level, while the superiority of the 
fall application is also significant. Differences among the levels show no 
sign of significance. 


TABLE 4.2 SusprvisioN OF THE TREATMENTS SUM OF SQUARES IN THE 
EXPERIMENT ON POTATO SCAB 


Source of variation d.f. 8.8. ms. F 
Treatments 6 972.3 162.0 3.61 * 
Control vs. sulphur i 518.0 518.0 11.54 ** 
Fall vs. spring application 1 228.2 228.2 5.08 * 
Comparisons among levels 4 226.1 56.5 1.26 
Error 25 1122.9 44.9 


* Denotes significance at the 5% level. 
** Denotes significance at the 1% level. 


The conclusions might be phrased as follows. “The. application of 
sulphur produced a significant decrease in the scab index, the averages 
being 22.6 for the untreated plots, 16.4 for plots with the spring applica- 
tion, and 10.2 for plots with the fall application. The fall application 
proved significantly better than the spring application. There was no 
indication that the higher levels of dressing were more effective than the 
lowest level.” It should be remarked that a continuation of this experi- 
ment and other experiments caused these conclusions to be modified, 
the higher levels having shown to better effect. 

A more extensive discussion of the analysis of experiments of this type 
is given by Snedecor (4.1). Note that, if certain observations are missing 
or have to be rejected, the method of computation remains exactly the 
same except for the slight complication that different treatments will 
usually have different numbers of replications. 

The practice of calculating a separate standard error for each treat- 
ment is still found in some types of experimentation. Unless the treat~ 
ments have different error variances, the use of a pooled error, as given 
by the analysis of variance, is recommended since more sensitive tests of 
significance are obtained. In cases where the experimental error appears 
to vary considerably from treatment to treatment, a test of homogeneity 
of the error variance, due to Bartlett (4.2), may be made. This test is 
also described in Snedecor (4.1, section 10.13). 

It is worth emphasizing that, even if the treatments are randomly as- 
signed to the units, the above methods of analysis do not apply when 
restrictions on the randomization are introduced at a later stage in the 
experiment. Consider a chemical experiment in which samples of liquid 
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from the same bottle are treated in four different ways. The samples, 
being mutually indistinguishable, are assigned at random to the treat- 
ments. It is, however, practicable to treat only a few samples during 
each work period, and the investigator decides to complete a single 
replication at each session. The result of this additional control is that 
the differences among replicates are eliminated from the experimental 
errors and must equally be eliminated from the estimated errors. The 
analysis should follow the method described in section 4.2. This point 
will arise frequently in experiments where many physical operations 
must be carried out. 


4,14 Standard Errors. The estimated standard error of the difference 
between two treatment means is 


Sq = #(-+-) 
d 
uty cP} 


This formula applies when a pooled error is used, s” being the pooled 
error m.s. per unit, and 7; and rz the numbers of replicates for the two 


|2s” 
treatments. If 7, = 72 = 7, the formula reduces to a The degrees 


of freedom for é-tests are those in s”. 


Example. The average effect of the sulphur has already been tested 
by an /’-test. We will perform the same test by means of a ¢-test applied 
to the treatment means. Since the control has 8 replications, and the 
mean of all sulphur dressings has 24 replications, 


sa = V (44.9)(% + ate) = 2.736 


The mean scab index for the control is 22.62, and that for the dressings 


is 13.33. Hence 
22.62 — 13.38 


= ——___—_—_ = 3.395 
2.736 
with 25 df. The value of @ is 11.53, in agreement with the F found in 


the analysis of variance. ; 
Tf the true error variances per unit are considered to be different for 
the two treatments, the appropriate formula is 
Bee Roy 
sa = =, ae 
rT rT 


where 1, 9” are the respective error m.s. per unit for the two treatments. 
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In this case the ratio of the treatment difference to sg does not follow 
Students ¢-distribution except in special instances. The development of 
correct significance levels for this case presents a problem that has stim- 
ulated much discussion in recent years. Various methods that have been 
suggested agree closely in practical application except when the numbers 
of degrees of freedom in s,” and s,” are small. Fisher and Yates (4.3, 
tables V1 and V2) give tables of the significance levels of d/sa derived 
from Fisher’s theory of fiducial probability. (Note that their s;? cor- 
responds to our s;7/r;, etc.) Where these tables are not readily ac- 
cessible, the following approximation is suggested. It probably errs 
slightly on the conservative side, in the sense that the value of t required 
for significance may be slightly too high. 

Let 71, nz be the numbers of degrees of freedom in s;”, s2”, respectively. 
From the ordinary ¢-table record the significance levels t,, tg correspond- 
ing to m; and nz degrees of freedom, respectively. The approximate sig- 
nificance level for the ratio d/sg is 


wit, + wot 
Wy + We 


U 


where 


Since ?’ always lies between the ordinary ¢ values for ny and nz degrees 
of freedom, this calculation is needed only for those occasional cases 
where d is close to the borderline of significance. Further, when ny = 
n2 =n, U’ is the ordinary ¢ value for n degrees of freedom. 

Example. This experiment is a case where one might suspect that the 
error variance would not be homogeneous. If a treatment is very suc- 
cessful in reducing scab, the scab indices for that treatment will all be 
close to zero; hence their variance must be small. With a treatment 
such as the control the scab index has a greater possible range of vari- 
ation. Consequently, we might expect the error variance for a treatment 
to depend on the mean produced by the treatment. With only 3 df. 
for the estimated error variance per treatment, we cannot hope to detect 
small differences in the true error variances. Perhaps the most important 
comparison is that between the variance for the control and the average 
variance for the sulphur dressings. The error m.s. will be found to be 
70.0 for the control (with 7 d.f.) and 35.2 for the dressings (with 6 X 3 
or 18 d.f.). The F-ratio, 1.99, falls just short of the 10% level of sig- 
nificance. 

Consequently, the conclusions will be on a sounder basis if we ascribe 
different true error variances to the control and the dressings. For a 
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t-test of the average effect of sulphur we now take 


70.0 35.2 
24 = a V°8.75 + 1.47 = 3.197 


The value of Lis 9.29/3.197, or 2.906. To test this at the 1% level by the 
approximate test above we have 


my = 7} my = 18; th = 3.499; & = 2.878; wm = 8.75; wo = 1.47 


Hence the value required for significance is ns 


yr = BT5)B.A99) + (1.47) (2.878) 
¥ 10.22 


= 3.410 


The average effect of sulphur now falls short of the 1% significance level. 
It remains significant at the 5% level. For comparisons among the dif- 
ferent sulphur dressings we would use ordinary ¢-tests, where s* is taken 
as 35.2 with 18 d.f. The effect will be to enhance the significance of 
comparisons among the dressings, though the conclusions quoted pre- 
viously are not altered. 

Ags mentioned in section 3.9, it is usually better to handle this type of 
experiment, where the error variance is thought to be a function of the 
mean, by converting the original data to a scale on which the error vari- 


TABLE 4.3 ANALYSIS OF VARIANCE OF THE SQUARE ROOTS OF THE SCAT INDICES 


Source of variation df. 8.8. m.8. Pr 
‘Treatments 6 18.22 8.04 4.19 
Control vs. sulphur 1 8.17 8.17 11.27 
Fall vs. spring application 1 4.59 4.59 33 
Comparisons among levels 4 5.46 1.36 1,88 
Error 25 18.12 0.725 
Error for controls 7 6.27 0,896 
Error for dressings 18 11.85 0.658 
Total 31 36.34 


ances will be more nearly constant. On the supposition that the error 
variance may be proportional to the mean, the square roots of the scab 
indices might have been analyzed. As a matter of interest this analysis 
is shown in table 4.3; square roots were recorded to only one decimal 
place. 

The error m.s. for the controls is now closer to that for the dressings, 
the F-ratio being 1.36 against 1.99 in the original analysis. Further, the 
F-ratios for most components of the treatments s.s. are higher than in the 


original analysis, suggesting an increase in the sensitivity of the analysis. 


—— So 
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4.2. Single Grouping: Randomized Blocks 


4,21 Description. The essence of this design is that the experimental 
material is divided into groups, each of which constitutes a single trial 
or replication. At all stages of the experiment the object is to keep the 
experimental errors within each group as small as is practicable. Thus, 
when the units are assigned to the successive groups, all units which go 
in the same group should be closely comparable. Similarly, during the 
cotirse of the experiment, a uniform technique should be employed for 
all units in the same group. Any changes in technique or in other con- 
ditions that may affect the results should be made between groups. 

This division into replications need be recognized only at those stages 
in the conduct of the experiment where the division may help to reduce 
experimental errors. In agricultural field experiments the division is 
made at the start when the plots are marked out in the field. Since 
neighboring plots are known to be more alike in fertility than plots some 
distance removed, each replicate consists of a compact group of plots 
and is made approximately square in shape if feasible. Cultivations 
designed to keep the land clean of weeds will usually be carried out with- 
out regard to the replications, because it is not believed that results are 
affected by the order in which plots are cultivated. Similarly, the plots 
will generally be harvested in whatever order is most convenient. ibe 
however, harvesting must be spread over a number of days, it is well to 
harvest the plots replication by replication, in case rainfall or other 
factors should produce changes in the weight of the crop from day to 
day, In other types of experimentation no real distinction might be 
made between replicates until relatively late in the experiment. 

The principal advantages of randomized blocks are as follows. 

1, By means of the grouping, more accurate results are usually ob- 
tained than with completely randomized designs. 

2, Any number of treatments and any number of replicates may be 
included. With the design as described above, each treatment will have 
the same number of replicates. If extra replication is desired for some 
treatments, each of these may be applied to two units within every 
group. This device provides twice the standard number of replicates 
for the treatments in question, at the expense of some increase in the size 
of the group, which now contains more than a single replication. Sim- 
jlarly a treatment may be applied three or four times in a group. 

3. The statistical analysis is straightforward. Mishaps which neces- 
sitate the omission of a complete group or of the entire data from one or 
more treatments do not introduce any complication in the analysis. 
When data from some individual units are lacking, the “missing-plot” 
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technique developed by Yates enables the available results to be fully 
utilized. Some extra computational labor is, however, involved, and if 
the gaps are numerous the design is less convenient in this respect than 
complete randomization. 

4, If the experimental error variance is larger for some treatments 
than for others, an unbiased error for testing any specific combination 
of the treatment means can still be obtained. 

No design is more frequently used than randomized blocks. Certainly 
if a satisfactory degree of precision is reached, there is little need to search 
for alternative designs. 

It is worth noting that the replication means provide unbiased com- 
parisons of the differences among replicates. Occasionally, these dif- 
ferences measure some property of the experimental material that is of 
interest. The variance-ratio test of the replications against the error 
m.s. requires some care in its interpretation. In a greenhouse experi- 
ment, for instance, a different type of soil might be used in each replica- 
tion. Significant differences among replications might be due either to 
differences in soil type or to differences in the positions of the replications 
within the greenhouse. 


4.22 Randomization. When the units have been grouped, the treat- 
ments are assigned at random to the units within each group. A new 
randomization is made for every group. Unless the number of treat- 


ments exceeds 16, tables 15.6 and 15.7 are convenient for this operation. 


4,23 Statistical Analysis. The example comes from an experiment 
carried out by the North Carolina Agricultural Experiment Station at 
Rocky Mount, N. C., in 1944. ‘The experiment tested the effects of 5 
levels of application of potash, supplying respectively 36, 54, 72, 108, 
and 144 Ib. KeO per acre, on the yield and properties of cotton. The 
measure chosen for’ analysis is the Pressley strength index. This is 
found by measuring the breaking strength of a bundle of fibers of a given 
cross-sectional area. A single sample of cotton was taken from each plot, 
and 4 determinations were made on each sample, The figures in table 
4.4 are the means of these 4 samples.* The experiment was arranged 
in 3 randomized blocks of 5 plots each. 

To make the computing instructions more general we suppose that 
there are ¢ treatments and r replicates, and that y denotes a typical ob- 


servation. 


* Since the machine which measures the index is calibrated in arbitrary units, no 
dimensions are ascribed to the data in table 4.4. The index can be converted ap- 
proximately into pounds per square inch by means of a regression formula, 
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Step 1. Find the treatment totals (7), the replicate totals (R;), and 
the grand total (@). 


TABLE 4.4 SrreNGrH INDEX OF COTTON IN A RANDOMIZED BLOCKS EXPERIMENT 


Replications 
Treatments (Sao A Coded 
Pounds K;O0 per acre 1 2 3 Totals dressing 
36 7.62 8.00 7.93 23.55 13 
54 8.14 8.15 7.87 24.16 8 
72 7.76 7.73 7.74 23.23 3 
108 TAT «7.57 «7.80 22.54 —7 
144 7.46 7.68 7.21 22.385 —17 
Totals 38.15 39.13 38.55 115.83 
Analysis of variance 
Source of variation af. 8.8. ms. F 
Replications r(-1) = 2 0.0971 
Treatments @-)) = 4 0.7324 0.1831 4.19* 
Error (—1)t-1)= 8 0.3495 0.0437 
Total re—-1 = 14 1.1790 


Subdivision of the treatments s.s. 


df. ss. m.s- F 
Linear response 1 0.5663 0.5663 12.96** 
Deviations 3 0.1661 0.0554 1.27 


Slep 2. The sums of squares are obtained as follows. 
@ (115.83)? 


Correction factor: C = fate TB = 894.4393. 
Total: >> y? — C = (7.62)? + (8.14)? +-+++ (7.21)? ~ 894.4393 = 1.1790 
R? 15)? 9.13)? 8.55)” 
Replications: >> . C= (eee * BOS DOEES 894.4393 = 0.0971 
Treatments: 
Ti? 23,55)? + (24.16)? +--++ + (22.35)? 
= 2g - Bet 18) +-+++ (22.35) 5944808 = 0.7324 


r 3 
Error: (total s.s.) — (replications s.s.) — (treatments s.s.) 


= 1.1790 ~ 0.0971 — 0.7824 = 0.3495 


The /-ratio for treatments (table 4.4) is 0.1831/0.0437, or 4.19, which 
js significant at the 5% leyel with 4 and 8 df. The treatment totals 
suggest that the strength decreases with increasing applications of 
potash, though there is a hint of a maximum in strength for the 54b. 
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application. Accordingly it is worth while to examine the shape of the 
response curve. We will first fit a linear regression on the amount of 
dressing. Note that successive increments of dressing are not equal, the 
last two increments being twice as large as the first two. This is a 
fairly common practice in cases where the effectiveness of an increase in 
dressing is expected to be smaller at the higher levels of application. 

Although the regression can be calculated by the standard formula, 
a simple coding of the dressings lightens the work. If we place the - 
lowest dressing (36 Ib.) at zero and take 18 Ib. as 1 unit on the scale, the 
dressings may be coded as 0, 1, 2, 4, and 6, respectively. A further de- 
vice is to subtract a common amount from these values so that their 
mean is zero. Since the mean of the coded dressings is 134, this is the 
amount that must be subtracted. To avoid fractions, we multiply all 
values by 5 and subtract 13 from each product. The resulting coded 
dressings are shown in table 4.4; the signs have been changed so that the 
regression coefficient will be positive. 

The regression coefficient is now obtained as 


(13) (23.55) -+ (8)(24.16) + (8)(23.23) — (7)(22.54) — (17) (22.85) 
(3)[(43)? + (8)? + 3)? + 7)? + 077] 
31.3 


= —— = 0.0180 
174 
The factor (3) in the denominator is inserted to convert the treatment 
totals to means. The regression coefficient represents the average de- 
crease in strength for a unit increase on the coded scale. Since 5 units 
on the coded scale correspond to 18 Ib. K20, the decrease in strength for 
each additional 18 Ib. K2O is estimated as 0.090. 
The contribution of the regression to the sum of squares in the analysis 
of variance is (by rule 1, section 3.42) 


(31.39)? 
1740 


as shown in the analysis of variance. The contribution from the linear 
regression is significant at the 1% level. Since the mean square for 


deviations from the regression, 0.0554, is only slightly above the error 


m.s., there seems no point in investigating a quadratic regression, though 


the reader may care to verify that its contribution is small. 

The conclusion from the data is that increased dressings of potash 
produce a weaker fiber, the strength index declining by 0.090 for each 
18 Ib.-increment in K,0. As an exercise we will obtain confidence limits 
for this rate of decline. We first require the estimated standard error for 


o1s 


= 0.5663 
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the 0,090 figure. This can be found by expressing the figure as a linear 
function of the treatment means 9; and applying rule 5a of section 3.51. 
We have 

0.090 = s8pLl3g1 + 872 + 393 — 794 — 1795] 
so that the estimated standard error is 


V 0.0437 5 5 5 5 
Fi = SNORT ame + (8)? +++-+ (17) 


5 V 0.0437 

~~ 4/1740 

Alternatively, this value could be derived by noting that the square root 

of the F-value for the linear regression, i.e., V 12.96, or 3.6, is the ratio 

of 0.090 to its estimated standard error. Thus the latter must be 

0.09/3.6, or 0.0250. Finally, for the 80% confidence limits, we have 

(0.090 + 0.025 X 1.397), or (0.090 + 0.035), where 1.397 is the value of 
t for a probability 0.20 and 8 df. 


= 0.0251 


4.24 Standard Errors. The estimated standard error of the difference 
between two treatment means is 


2s? 
sa = = 
r 


If some treatments receive extra replication, the general formula is 


ee (- ~) 
a s =F 
is Te 


In experiments where the error appears to be heterogeneous, a separate 
error may be obtained for any pair or for any group of treatments. For 
this a new randomized blocks analysis is carried out on those treatments 
which belong to the group under consideration. 


4.25 Missing Data. The method of analysis when part of the data is 
missing is described by Yates (4.4). With a single missing unit, the first 
step is to calculate a value for the unit by means of the formula 
Dict kG 
(rr — )é- 1) 
where B is the total of the remaining units in the block where the missing 


unit appears, 7 is the total of the yields of this treatment in the other 
blocks, and G is the grand total; r and ¢ are the numbers of replicates and 


y (4.1) 
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treatments respectively. The analysis of variance is then carried out as 
usual except that 1 df. is subtracted from the total s.s. and from the 
error 8.8. 

The standard error of the difference between the mean of the treat- 
ment with a missing value and the mean of any other treatment is 


of? 
' [; a rir — I(t — | oe 


When there are several missing values, for units a, b, ¢, d,- ++, we first 
guess values by inspection for all units except a. Formula (4.1) is then 
used to find an approximation for a. With this approximation and the 
values previously assumed for ¢, d,-++, we again use formula (4.1) to in- 
sert an approximation for b. After a complete cycle of these operations, 
a second approximation is found for a and so on until the new approxi- 
mations are not materially different from those found previously. The 
analysis of variance is then completed; for each missing unit, 1 d.f. is 
subtracted from the total and error s.s. 

Suppose that in the previous example two observations had been 
missing and that the data had appeared as follows. 


Replications 

Pounds K,0 1 2 3 Totals 
36 a 8.00 7.93 15.93 

54 8.14 8.15 7.87 24.16 

72 7.76 b 7.74 15.50 

108 TAT 7.57 7.80 22.54 

144 7.46 7.68 7.21 22.35 
Totals 30.53 31.40 38.55 100.48 


Since differences between replications are not pronounced, we might, 
take as a trial value for a the mean of 8.00 and 7.93, or 7.96. For 
estimating bi, the first trial value of b, by formula (4.1) we now have 

B = 31.40; T = 15.50; G = 100.48 + 7.96 = 108.44 


so that 
(3) (31.40) + (5)(15.50) — 108.44 


ya 8 7.91 
For estimating a2, the second trial value of a, we now have 
B = 30.53; T =.15.93; G = 100.48 + 7.91 = 108.39 
so that ; 
= (3) (80.53) + (5)(15.93) — 108.39 sehen kG 


ag 8 
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Taking ay as 7.86, we find that the next trial value of b is 7.92. This is so 
close to the previous value that we may stop. Thus a = 7.86, b = 7.92. 

The analysis of variance is now computed with these values inserted. 
There will be 12 d.f. in the total s.s., and 6 df. in the error s.s. The error 
m.s. is 0.0491. 

To obtain a standard error for the comparison of two treatments, A 
and B, we assign an “effective” number of replicates to each treatment. 
Any replication of treatment A is counted as 1 when both A and B are 
present in the replicate, as 14 when A is present but B is not, and as 0 
when A is missing. The same rule is applied in scoring B. Suppose that 
in the example we are comparing the means of the 36- and 72-lb. dress- 
ings. The effective replication for each mean is scored as 1144. The 
standard error of their difference is taken as V s2(24 + 24), or V1.3388?. 
If we are comparing the mean of the 36-lb. dressing-with that of the 54- 
Ib. dressing, which has no missing values, the score for the first mean is 
2, and that for the second 214, so that the standard error of their dif- 
ference is V/s2(44 + 34), or V0.98". 

This useful approximation is due to Yates; the exact formulae are 
laborious to compute. The approximation usually gives a standard 
error that is slightly too high. In the example the correct values for the 
standard errors of the two differences considered above are 1.069s and 
0.937s, as compared with the approximate values of 1.155s and 0.949s, 
respectively. 

As a result of the disturbance introduced by the missing units, the 
treatments m.s. is slightly too large; however, the variance-ratio test is 
unlikely to be much in error unless a substantial proportion of the units 
are missing. Yates (4.4) gives the method for obtaining an exact test. 


4,26 Estimation of Efficiency. If H, and £, are the block and error 
m.s. and np, m, and n, the block, treatment, and error degrees of freedom, 
Fie a ee ae ee (4.3) 
Ny + Nt + Ne 
is an estimate of the error variance of a completely randomized design 
with the same experimental material. Comparison of £,,, and Ee, 
taking account of the change in numbers of error degrees of freedom 
(section 2.31), provides an estimate of the increase in accuracy which 
results from the grouping into replicates. 

The result in (4.3) may be proved in various ways. Since there is no 
complete discussion of results of this type in the literature, one method 
of proof will be sketched; the details, which are a matter of algebraic 
manipulation, will not be given completely. ‘This proof uses the proper- 


4,26 ESTIMATION OF EFFICIENCY 101 


ties of randomization. It is not the easiest proof, but requires very few 
assumptions. 

Let the experiment have ¢ treatments and r replications, and let ¢; . 
be the experimental error of the observation on the ith unit, and 7; the 
effect of the jth treatment. No assumption is made about the nature 
or distribution of the errors. For this reason it is not necessary to 
introduce any specific symbol for the effect of the replication, since 
any type of effect can be represented by appropriate choice of the e’s. 
The treatment effect and the error are assumed to be additive; that is, 
if the randomization happens to put the jth treatment on the ith unit, 
their joint effect is (7; + ¢;). Throughout the randomization the e; are 
regarded as a set of fixed numbers, each associated with a specific unit, 

Without loss of generality, we may assume that the total of all ob- 
servations is zero, and that the totals of the e’s and the 7’s are both zero. 


Let rt t 
S= dYe?; T=>77 
t=1 j=1 
Further, for all possible randomizations of the randomized blocks design 
the replications will of course remain the same; consequently, for any 
given batch of data the replication totals remain unchanged. Let R de- 
note the sum of squares of these totals, divided by t. 

The main part of the proof consists in working out the average values 
of the mean squares in the analysis of variance, taken over all possible 
randomizations of each type of design. The results come out as shown 
in table 4.5. 


TABLE 4.5 AVERAGH VALUES OF MEAN SQUARES TAKEN OVER THE 
RANDOMIZATION SETS 


Completely randomized 


df. ms. 
pe Ss 
Treatments (-—) a) at =a 
Error u(r — 1) a5 
Randomized blocks 
df. m.s. 
sf R 
Replications (r — 1) Ga) 
; rT 8 R 
Treatments ¢@-—1) tp a0 —D Sean 
Ss R 
Error ( — 1) — 1) 


ré—-1) 7-1) 
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This analysis leads at once to the result. Since 
ny = (r — 1); m+ ne = rt — 1); 


jt is seen from table 4.5 that for randomized blocks the average value 
of npBp is R, while that of (nm: + n-)E. is (S — R). It follows that the 
average value of 


mE t+(m+n)Be R+S-R_ 8 
my + me + me =) G1) 


But this is the average of the error m.s. for a completely randomized 
design on the same data. 

As an illustration of the details, consider the treatments m.s. for ran- 
domized blocks, which is probably the hardest term. The square of the 
first treatment total is 


Ty? = (rr + er + ea tess + Gy)? 


When we average over all possible randomizations, there is no contribu- 
tion from terms in re;, since the treatment appears equally often on all 
units, and the total of the e’s is zero. The average value of the contri- 
bution from the r terms in e” is the sum of the squares of all the e’s, 
divided by ¢, or S/t, since the treatment appears on any specific unit 
in a fraction 1/¢ of all randomizations. 

The contribution from terms in e;e; may be written 


e1(é2 + ¢3 +--+ er) + ea(er + €3 e+ >t er) >> 

+ @r(e1 + 2 +++++ e-1) (4.4) 
The randomization is restricted so that every e comes from a different 
replication. If the subscript denotes the replicate, the mean value of 


e,e; is R:R;/t, where R; is the replication total of the e’s. Consequently, 
the mean value of (4.4) is 


1 
a Baa Bis home Re) e Ral Ha +---+R,)+-:- 


1 R 
+ RR + Re ++-++ Ra) = — ph? + Ral +--+ Be?) ay 
since the total of the R; is zero, and since R will be recognized as the sum 
of squares of the R;, divided by t. Hence 
fia 
(D2) = Pn? += = — 
t t 
If this expression is summed for all ¢ treatments and divided by r(¢ — 1) 
to give the treatments m.s., the result in table 4.5 follows. 
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4.3 Double Grouping: Latin Squares 


4.31 Description. In the latin square the treatments are grouped into 
replicates in two different ways. Examples of latin squares are shown 
for different numbers of treatments in plan 4.1 (p. 119). It will be noted 
that every row and every column of any square is a complete replication. 
The effect of the double grouping is to eliminate from the errors all dif- 
ferences among rows and equally all differences among columns. Thus 
the latin square provides more opportunity than randomized blocks for 
the reduction of errors by skillful planning. 

The experimental material should be arranged and the experiment 
conducted so that the differences among rows and columns represent 
major sources of variation. Some examples of the uses of latin squares 
in various fields of research may indicate the utility of the design. In 
field expériments the plots are usually laid out in a square formation, 
so that soil fertility and other variations in two directions are controlled. 
Variations along and across the greenhouse bench may be similarly 
handled in greenhouse experiments. Occasionally the latin square is 
advantageous even when the plots form a continuous line. In this case 
the rows may be compact blocks of land while the columns specify the 
order within each block. If the yield gradient is suspected to be in the 
same direction all along the line, blocks and order in blocks together re- 
move the effects of the gradient more thoroughly than a single control. 
An example of this type is shown below. 


TABLE 4.6 A LATIN SQUARE WITH THE PLOTS IN ONE CONTINUOUS LINE 


Seven treatments A, B, C, D, EF, F, G 
(Experimental limitations force the plot units to go at right angles to gradient.) 
Rep. I 


aig m Iv A VI vil 
HAP peepee dle Hie 

Main and Tippett (4.5) describe how 4 X 4 latin squares were em- 
ployed in experiments on the weaving of cotton cloth. The purpose of a 
series of experiments was to investigate the effect of the sizing treatment 
applied to the warp. The criterion was the number of breaks in the warp 
during weaving. Four warps, each with a different sizing treatment, 
were woven simultaneously on 4 different looms, which could be super- 
vised by a single weaver. Then each warp was moved to a different loom 
of the set so that after 4 periods every sizing treatment had been tested 


on all 4 looms. 


F 


° 


ES 
EI 
S 
S) 
> 
EI 
= 
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If A, B, C, D represent the 4 warps, the latin square used was as 
follows. 


Looms 
Periods 1 2 3 4 
I A D B Cc 
II D Cj A B 
Ill Cc B D A 
Iv B A (6 D 


This arrangement eliminates constant differences among the looms, 
which were found to be large, and also differences among the 4 periods 
of weaving. No “period” differences were apparent—a result which 
might be anticipated, the authors suggest, because humidity and warp 
tension were controlled throughout the experiment. 

Frequently, particularly in industry, an experiment requires a series 
of operations each of which may introduce variability into the final re- 
sults. In such cases the latin square may be useful in a preliminary in- 
vestigation of the sources of variation. For example, in the preparation 
of an, explosive mixture used in primers, variation may occur either in 
the mixing of the ingredients of the explosive or in the process of charg- 
ing. One experiment of this type involved 4 mixing-blending teams and 
4 charging operators. On each day, the product of each team was sent 
to a different charging operator, the arrangement being changed daily 
according to the following 4 X 4 latin square (letters W, X, Y, and Z 
represent the mixing-blending teams). 


Charging operators 


1 2 3 4 
Monday Ww Z x Y 
Tuesday Xx Ww v6 Z 
Wednesday Y x Z Ww 
Thursday Z Y Ww xX 


The latin square analysis of variance enables us to isolate consistent dif- 
ferences amongst the teams and consistent differences amongst the 
chargers, as well as day-to-day variations. 

To quote a more complex example, a 12 X 12 latin square was used 
by Chen, Bliss, and Robbins (4.6) to estimate the toxicities of 12 poisons 
when applied to cats. A single replication of this experiment occupied 
a complete day, and the experiment was repeated for 12 days. Each 
drug was injected into the femoral vein of a cat at 1 ce. per minute until 
the cat died, and the measurement recorded was the dose required to 
cause death. An observer could administer only two of the drugs at a 


sae 


ee — 
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given time. Consequently, three observers were used, each treating 
two cats in the morning and two in the afternoon. The rows of the latin 
square were used to eliminate systematic differences between observers 
and between the morning and afternoon injections. The columns repre- 
sent days. The experimental plan was as follows (letters represent 


drugs). 


Day 
Time Observer 

1/2/3/4/]5|6)]7]81]9 | 10/11)12 

10:30 a.m. ut POR |e RNa | ae 2) | a ie 
Re Gli spe 1B De] REN VD a 

Il js fad ES (oC Jr 2. VE Va ek aU i Td hy 
E\|D|F\@\|s |K\/|A4)/L\C)r |B) a 

It Coe ARB Ome fe EE | sD Go ER Pe 

PEK Ie) a | C | DBA Lyd ie 

2:30 p.m. I CR NA NE | eB NG es 
D|FI|I|AI/LI|£E|C|\|G@G\J/B/H|K 

IL SEBO ACC | RO Ges PRN oe 
BBE Ve eC A) Bey BO) Re Gs aie 

It Ge NE | A Oe 
L\|A\H|T|B\D\EB|F |) F |) kK) CG 


During the course of the experiment every drug was given equally 
often by each observer and equally often in the morning and afternoon. 
The arrangement is a good example of the way in which a single grouping 
(rows) can control simultaneously more than one potential source of 
variation. All three major variables—day, time of day, and observer— 
would have inflated the experimental error if left uncontrolled. 


4.32 Number of Replications. The chief restriction on the utility of 
the latin square is that the number of replicates equals the number of 
treatments; if the latter is considerable, the number of replications re- 
quired becomes impractical. Squares larger than 12 X 12 are seldom 
used, while the most common range is from the 5 X 5 to the 8 X 8 
square. Latin squares also suffer to some extent from the same dis- 
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advantage as randomized blocks in that the experimental error per unit 
is likely to increase with the size of square. 

The small squares provide only a few degrees of freedom for the 
estimation of error—none with the 2 X 2, two with the 3 X 3, and six 
with the 4 x 4. This fact precludes the use of single 2 X 2 squares, 
while the 3 X 3 and 4 X 4 squares must produce a substantial reduction 
in error over randomized blocks or complete randomization to counter- 
balance the loss of degrees of freedom. More than one square may, 
however, be included in the same experiment. Three 3 X 3 squares 
(9 replicates) furnish 10 error d.f., while two 4 X 4 squares give 15 df, 
provided that in each case the “squares X treatments” interaction can 
be pooled with error. A considerable number of 2 X 2 squares would be 
required, since the error degrees of freedom are 1 less than the number of 
squares. 


4,33 Randomization. A complete representation of the squares from 
4 X 4 to 6 X 6 and sample squares up to the 12 X 12 is given by Fisher 
and Yates (4.3). For squares up to the 6 X 6, the randomization pro- 
cedure given in this reference selects (with a minimum of labor) a square 
at random from all latin squares of a given size. For a discussion of the 
theoretical basis of the randomization, see Yates (4.7). 

Examples of latin squares are shown in plan 4.1 (p. 119). The method 
of randomization for plan 4.1 is as follows. 

3X8. Arrange the columns at random and the last 2 rows at random. 

“4X4. Select at random one of the 4 squares. Arrange at random all 
columns and the last 3 rows. It is equally good, though not strictly 
necessary, to randomize all rows and columns. 

5 X 6 and higher squares. Arrange all rows, columns, and treatments 
independently at random, 

For 3 X 3 and 4 X 4 squares, this procedure selects one square at 
random from all possible squares. For 5 X 5 and larger squares, some 
types of squares have no chance of being selected if plan 4.1 is used. 
Unless latin squares are used very frequently, however, the randomiza- 
tion sets are sufficiently large for experimental plans. 


4.34 Statistical Analysis. Several experiments have demonstrated 
that people find it difficult to select, by personal judgment, unbiased 
samples even from relatively small populations that can be thoroughly 
inspected before selection. In this experiment, each population con- 
sisted of a small area of wheat, containing about 80 shoots, the shoots 
being slightly over 2 feet high. There were 12 samplers, all experienced 
in studying the growth of wheat. Hach sampler inspected each area and 
measured the heights of 8 shoots whose heights were to give a represent- 
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ative sample of the shoot heights in the area. The quantity that will be 
analyzed is the difference between the mean height of the 8 selected 
shoots and the true mean height in the corresponding area, in other 
words the sampler’s error. 

The samplers were divided into two groups of six, of which we will 
consider only the first group. The samplers represent the experimental 
treatments. There were 6 areas, each sampled by all 6 samplers in the 
group. These areas, which serve as the replications, may be taken as the 
columns of the 6 X 6 latin square. The rows of the square prescribed 
the order in which each man sampled the 6 areas (see table 4.7). This 


TABLE 4.7 SAMPLER’S ERRORS IN SHOOT HEIGHTS (CENTIMETERS) 
6 X 6 LATIN SQUARE 


Areas 
Order Totals 
a 2 3 4 5 6 
I F+ 3.5 | B+ 4.2 | A+6.7 | D+6.6 C+4.1 | £+3.8 +28.9 
I B+ 89| F+ 1.9 | D+5.8 | A+4.5 E+2.4 | C+5.8 29.3 


Tit C+ 9.6 | E+ 3.7 | F—2.7 | B+3.7 | D+6.0 | A+7.0 +27.3 
IV D+10.5 | C+10.2 | B+4.6 | £+3.7 | A+5.1 F+3.8 +387.9 
Vv E+ 31| A+ 72 | C+4.0 | F—3.3 | B+3.5 D+5.0 +19.5 
VI A+ 5.9] D+ 7.6 | E—-0.7 | C+3.0 | F+4.0 B+8.6 +-28.4 


Totals +41.5 434.8 +17.7 +182 +251 +3840 +1713 = G4 


Totals for samplers 
A B Cc D E P 
436.4 +33.5 +36.7 441.5 +16.0 +7.2 


feature served two purposes. It helped to prevent the men from getting 
in each other’s way, since no two men worked on the same area at the 
same time, and it permitted an examination of the effects of the order of 
sampling on the errors. 

Since only 3 of the 36 errors in table 4.7 are negative, there is evidence 
of a rather consistent tendency towards overestimation of the shoot 
heights. For simplicity, the observations will be referred to as measures 
of bias, though of course they contain a component due to chance 
fluctuations. There is also an indication, on casual inspection of the 
table, that the bias varies from one sampler to another. This point will 
be tested in the analysis of variance. The computations are as follows. 
First find the totals for each row, column, and latin letter and the grand 
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total, G, shown in table 4.7. The following sums of squares are then 
calculated. 


@ (171.3) 


Correction factor: C = Seer = 815.10 

Total: (8.5)? + (4.2)? +-+-+ (8.6)? — C = 1144.73 — 815.10 = 329,63 
Rows: $[(28.9)? +--+-+ (28.4)?] — C = 843.70 — 815.10 = 28.60 
Columns: $[(41.5)? ++ +++ (34.0)?] — C = 803.97 — 815.10 = 78.87 
Treatments: ${(36.4)? ++ +++ (7.2)?] — C = 970.70 — 815.10 = 155.60 


The divisor 6 is replaced by r for an r X r square. 
These sums of squares are entered in the analysis of variance. The 
error s.s. is found by subtraction. 


. 


TABLE 4.8 ANALYSIS OF VARIANCE FOR A 6 X 6 LATIN SQUARE 


Source of variation Gb, 8.8. m.s. F 
Rows (order of sampling) (r — 1) = 5 28.60 5.720 1.72 
Columns (areas) (r —1) = 5 78.87 15.774 4,74 ** 
Treatments (samplers) ~ — (r — 1) = 5 155.60 31.120 9.35 ** 
Error (r — 1)\(r — 2) = 20 66.56 3.328 

Total (?-1) =85 329.63 


To obtain the mean squares, each sum of squares is divided by the 
corresponding number of degrees of freedom, The F value for samplers 
is 31.120/3.328, or 9.35, well beyond the 1% level, which is 4.10 for 
5 and 20 df. This shows that the extent of the overestimation varies 
from one sampler to another. The means for the samplers are given 
below. 


A B Cc D E F 
+6.07 +5.58 +6.12 +6.92 +2.67 +1.20 


The estimated standard error of each mean is V s?/r, where s* is the 
error m.s., or in this case V (3.328)/6 = 0.745. For testing the dif- 
ference between a pair of means, the standard error is 1/2 (0.745) = 
1.053. Since the 5% t value for 20 d.f. is 2.086, the difference between 
two means must be at least (2.086)(1.053), or 2.20, in order to attain 
significance at this level. It appears that the samplers fall into two sets— 
A, B, C, D and E, F, the biases being significantly smaller for Z and F 
than for the others. 

Some further information about the nature of the biases can be ob- 
tained from an examination of the row and column totals. From table 
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4.8 the F value for rows is seen to be 1.72, which is below the 5% value, 
2.71. No clear effect of the order in which the areas were sampled has 
therefore been established; further, the row totals (table 4.7) do not sug- 
gest any consistent trend. 

On the other hand, the F value for the columns m.s., 4.74, is significant 
at the 1% level, so that the bias varies from area to area. Previous 
observations had indicated that the bias might depend on the mean shoot 
height of the area. The column totals are ranked below in order of in- 
creasing true mean shoot height. 


‘True mean Column 
shoot height total 
Area (centimeters) (centimeters) 

1 59.0 +41.5 

2 66.2 +34.8 

6 72.3 +34.0 

4 74.5 +18.2 

5 76.0 +25.1 

3 76.4 +17.7 


Apparently on areas where the shoots are not so high the positive bias 
is greater than on areas with relatively high shoots. The result is not 
surprising, since the sampler had to stoop in order to do his work. The 
linear regression of the column totals on the true mean heights contrib- 
utes 59.88 to the sum of squares for columns, 78.87. The mean square 
for deviations from this regression (4 d.f.) is only 4.75, which is not sig- 
nificantly greater than the error m.s. The inference is that the large 
value of the mean square for columns is due mainly to the negative 
correlation between the bias and the mean height. For further discussion 
of this experiment, see (4.8). 

The analysis of two 4 X 4 squares is discussed in (4.9). A numerical 
example with five 3 X 3 squares will be found in (4.10). It should be 
noted that with several latin squares the interaction of treatments with 
squares may be tested. If there is no reason to expect that this inter- 
action is real, the corresponding sum of squares may be combined with 


error. 
os? 
4,35 StandardErrors. The formula f i gives the estimated standard 


error of the difference between two treatment means, where s” is the 
mean square per unit and 7 the number of replicates. In cases where 
heterogeneity of errors is suspected, the error cannot be subdivided with 
the same ease as in randomized blocks. It is possible to remove from the 
error 8.8. the contribution of any one of the treatments, Yates (4.11). For 
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each yield y; of the treatment, calculate the quantity 
d;=R: + Ci — 1: 


where R; and C; are the corresponding row and column totals, and r is 
the number of treatments. The sum of squares of deviations of the 
quantities d;, divided by r(r — 2), is the contribution of this treatment 
to the error s.s. and has (r — 1) degrees of freedom. For illustration, 
suppose that we wish to calculate the contribution of sampler /’ to the 
error in section 4.34. From table 4.7 the first d value is 


d, = (+28.9) + (+41.5) — 6(+3.5) = +49.4 


and the remaining values will be found to be +52.7, +61.2, +-57.5, 
+29.5, +49.1, with a total +299.4. The sum of squares of deviations 
from the mean is 610.34. Divided by 24, this gives 25.43, with 5 d.f., 
for the contribution of sampler F to the error s.s. Consequently the 
error 8.8. (table 4.8) may be divided as shown below. 


df. 8.8. m.8. 
Total error 20 66.56 3.328 
Contribution of F 5 25.43 5.086 
Remainder 15 41.13 2.742 


The error m.s. for the suspected treatment may be compared with 
the mean square for the remaining (r — 1)(r — 3) degrees of freedom, 
to test whether the treatment shows abnormally high variation. In 
this example the variance ratio is 1.85, with 5 and 15 df. If it is con- 
cluded that the treatment is unusually variable, the mean square with 
(r — 1) (r — 8) degrees of freedom is used as error for the other treat- 
ments. It should be stressed that the ordinary variance-ratio tables can 
be used in this test only if the decision to make the test, was taken before 
examining the individual results. If the test is made merely because one 
treatment looks anomalous on inspection, different significance levels 
that take account of this fact are needed, as discussed in section 3.53. 


4.36 Missing Values. Except for changes in the formulae, the pro- 
cedure is similar to that for randomized blocks (section 4.25). A single 
missing value is substituted by means of the formula, reference (4.4) 


_rR+C+ 7) —2¢ 
SG =r) 


where R, C, T, and G are respectively the totals of the row, column, and 
treatment which contain the missing value and the grand total. The 


(4.5) 
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estimated standard error of the difference between the corresponding 
treatment mean and the mean of a treatment with no missing values is 


2 1 
ie le 2 Sae 5) Go 


When several values are missing, repeated application is made of for- 
mula (4.5) as described in reference (4.4). When values have been in- 
serted for all missing data, the usual analysis of variance is calculated. 
One degree of freedom is subtracted from the error df. for each missing 
value. 

The exact formula for the standard error of a treatment mean is com- 
plex. Yates (4.4) proposed a useful approximate rule, which shows the 
number of replicates to be assigned to any treatment mean for a com- 
parison with another treatment mean. Each observation on one treat- 
ment is given one replication if the other treatment is present in both 
the corresponding row and column. The replication is 24 when the 
other treatment is missing in the row or column and is 14 when the other 
treatment is missing in both the row and column. When the treatment 
itself is missing, the replication is 0. Although complicated at first sight, 
the rule is not difficult to apply and is reasonably accurate. Consider for 
instance the following 6 X 6 square with 3 units missing (two B’s and 
one ) as indicated by the parentheses (). 


B E Cc F D A 
F D (B) E A Cc 
Cc A E D B Pr 
(E) Cc A (B) F D 
A F D Cc E B 
D B F. A (4 E 


Taking the yields of B and # in the order of the rows, we ascribe the 
following numbers of replicates. 


B: 24+0+14+04+141= 33 = 
E: 1+44+3+04+1+1=4 
Hence the standard error of the difference between the treatment means 


is V8? G3 +4). For the difference between A and B, the reader may 


verify that the standard error is V2; +). Notice that the ef- 
fective replication for a treatment may change from one comparison to 
another. 

The analysis required when a single row, column, or treatment is miss- 
ing is described by Yates (4.11). When more than one row, column, or 
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treatment is missing, consult Yates and Hale (4.12). These methods 
are also presented by DeLury (4.13). 


4.37 Estimation of Efficiency. The effectiveness of either the row or 
the column grouping may be tested from the results of latin squares. 
The expression 
_ yl, + (ma + Ne) Be 
Ny + Mt + Ne 

is an estimate of the error m.s. which would have been obtained if the 
row grouping had not been used, i.e., if the design had been randomized 
blocks with the columns as blocks. In the formula, Z, and EF, are the 
mean squares for rows and error, respectively, and n,, nz, N» are the num- 
bers of degrees of freedom for rows, treatments, and error. The in- 
creased number of degrees of freedom with randomized blocks is taken 
into account by the method of section 2.31. The effectiveness of the 
column grouping is tested similarly. 

A 5 X 5 experiment on potatoes will serve as an illustration. The 
mean squares for the yields of potatoes were as follows. 


E’ 


df. m.8. 
Rows 4 62.6 
Columns 4 104.7 
Treatments 4 272.0 
Error 12 26.2 


Tf the experiment had been conducted in randomized blocks with the 
columns as blocks, the estimated error m.s. would be 


4(62.6) + 16(26.2) _ 
ee ae 


The degrees of freedom available for error would increase from 12 to 16. 
To take account of the advantage of the additional degrees of freedom 
(section 2.31), we reduce 33.5 by the multiplier (13)(19)/(17)(15), or 
0.969. This gives 32.5 as a comparable mean square for randomized 
blocks. The row grouping increased the information by an estimated 
25%. Of course a considerable number of these comparisons would be 
needed for drawing general conclusions. 


33.5 


44 Cross-over Designs 


4.41 Description. In dairy husbandry and biological assay a design 
has been used which closely resembles the latin square but may have 
some advantages when the number of treatments is small. In the sim- 
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plest case where there are two treatments A and B, the units are first 
grouped into pairs as if a randomized block design were to be used, 
Suppose that from previous knowledge one unit in each pair is expected 
to give a higher response than the other and that the difference in favor 
of the superior unit is expected to be about the same in all pairs. It will 
clearly be advisable to ensure that each treatment is applied to the 
“better” member in half the replicates and to the “poorer” member in 
the other half. The pairs or replicates, which must be even in number, 
are divided at random into two equal sets, the first set to receive treat- 
ment A on the superior member of each pair, the second to receive B. 

The experiment discussed in chapter 1 provides an example. The 
data (shown in table 4.10) are the times required to compute the sums 
of squares of 27 observations on each of two machines whose speeds it was 
desired to compare. Ten sums of squares were calculated, making 10 
replications of the experiment. The cross-over design was used because 
it was thought that the second computation of a sum of squares might 
be faster than the first, so that for a fair comparison each machine should 
be used first in five of the ten replications. The randomization assigned 
machine A to be used first in replications 1, 3, 6, 8, and 9. 

The degrees of freedom in the analysis of variance are as follows. 


df. 

Columns (pairs or replicates) 9 
Rows 2 
Treatments 1 
Error 8 
Total 19 


Note that only 1 d-f. is assigned to rows. That is, the arrangement re- 
moves from error only the average difference between the 2 rows. In so 
far as the difference is not constant but varies from one replication to 
another, this variation enters into the experimental error. 

In some experiments it is known that this difference will vary, and it 
is possible to estimate in advance (at least roughly) whether the dif- 
ference will be large or small for a given replicate. Suppose that the dif- 
ference is expected to decrease steadily from replicate 1 to replicate 10. 
With this knowledge a more accurate design is obtained by the use of 
five 2 X 2 latin squares, as shown below. 


TABLE 4.9 Frve 2 X 2 LATIN SQUARES FOR 2 TREATMENTS IN 10 REPLICATES 


Rows I IL Il Iv Ry 

Pee ae ie 5 6 eS 9 10 
Better A B A B B A A B ANB: 
Poorer B A BOA A B BoA BoA 
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In this arrangement we remove from error 5 d.f. for rows, one in each 
square. Thus not only the average difference is removed, but also the 
variation in the difference from square to square. The combined analysis 
of variance of the 5 squares is set out as follows. 


df 

Squares 4 
Columns within squares 5 
Rows within squares 5 
Treatments 1 
Error 4 
Total 19 


The 9 df. for squares and columns within squares are exactly the same 
as the 9 df. for columns in the cross-over design. Owing to the more 
complete elimination of the row effects, there are only 4 d.f. for error 
instead of 8 with the cross-over plan. 

To summarize, the cross-over design is particularly appropriate when 
the difference between the rows is substantially the same in all replicates, 
for in this case the whole of the real difference between rows is con- 
centrated in the single degree of freedom and the error variance is no 
larger than that of the latin squares. Even if the difference between rows 
is known to be variable, the cross-over design may be preferable in small 
experiments where few degrees of freedom are available for error. 

In dairy husbandry the cross-over may be used to compare the effects 
of two feeding rations on the amount and quality of milk produced by 
the cow. Since cows vary greatly in their milk production, each ration 
is tested on every cow by feeding it during either the first or the second 
half of the period of lactation, so that each cow gives a separate replicate. 
The milk yield of a cow declines sharply from the first to the second half 
of its period, so that the first half is always “‘better,” in the sense above. 
Whether a cross-over is superior to a set of latin squares depends on 
circumstances. The rate of decline is not constant from cow to cow; it is 
greater in general for high-yielding than for low-yielding cows. Thus if 
previous production records are available, the cows may be divided into 
pairs on the basis of yielding ability, each pair being made a separate 
2X 2 latin square. This plan is likely to give a smaller error than the 
cross-over, though sometimes not sufficiently smaller to counterbalance 
the loss of degrees of freedom. 

The design can be used with any number of treatments, subject to the 
restriction that the number of replicates must be a multiple of the num- 
ber of treatments. With three treatments, for example, a plan can be 
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drawn up from the 3 cycles ABC, BCA, CAB, where the order of the 
letters denotes the row to which each treatment is applied. Each cycle is 
allotted at random to one-third of the replicates. For higher numbers of 
treatments a design is constructed in the same way from the columns of 
any latin square. When the number of treatments exceeds four, how- 
ever, the degrees of freedom for error are sufficiently large so that a set 
of latin squares is usually preferable. 


4.42 Statistical Analysis. All sums of squares are calculated in the 

usual way. For the example (table 4.10), the computations are as fol- 

lows. 

(352)? 
20 

Total: (30)? + (14)? + (21)? +--+ (23)? — 6195.2 = 910.8 


Correction factor: = 6195.2 


Columns: 3[(44)? + (42)? +:+++ (47)?] — 6195.2 = 357.8 
= 2 
Rows: Sey = 645 
20 
= 2 
Treatments: (QUEEEBE = 320.0 


20 


TABLE 4.10 Cross-OVER EXPERIMENT FOR COMPARING THE SPEEDS OF TWO 
: COMPUTING MACHINES A AND B 


Time (seconds minus 2 minutes) taken to calculate a sum of squares 


Columns (replications) 
Rows Totals 
1 2 3 4 5 6 7 8 9 | 10 


First (poorer) | A30 | B21 | A22 | B13 | B13 A29 | B 7 | A12] A283 | B24) 194 
Second (bet- 
ter) B14 | A21| B 5| A22| Al8 | B17 | A16 | B14 | B 8 | A23 158 


Totals 44| 42| 27| 35| 31| 46] 23] 26] 31] 47 352 
Treatment totals: A 216; B 136 


Analysis of variance 


Source of variation df. 8.8. ms. 
Columns (replications) 9 357.8 39.8 
Rows (first versus second) 1 64.8 64.8 
Treatments i 320.0 320.0 ** 
Error 8 168.2 21.0 

Total 19 910.8 
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The error s.s. is found by subtraction. The F-ratio for treatments is 
significant at the 1% level. 

We will calculate the confidence limits for the true difference in speed 
which were quoted in section 1.2. The average observed difference is 
8.0 seconds in favor of machine B. The estimated standard error of this 
difference is V 2(21.0)/10, or 2.049, based on 8 d.f. Consequently the 
confidence limits for the true difference are 


8.0 + 2.0490 


where ¢’ is the value of Student’s ¢ corresponding to (1 — P), where P is 
the confidence probability chosen. For P = .95, t/ = 2.306, and the 
limits are (8.0 + 4.7), or 3.3 and 12.7. For P = .80, t’ = 1.397, and, 
for P = .99, t’ = 3.355, giving confidence limits of (5.1, 10.9) and (1.1, 
14.9), respectively, as quoted. 

Worked examples for the case where there are two treatments are also 
given by Brandt (4.14) and Fieller (4.15); the latter also illustrates the 
analysis of covariance for this design and the procedure when one column 
is missing. 

In general, if there are ¢ treatments and r replicates, the degrees of 
freedom subdivide as follows. 


df. 
Columns (r—-) 
Rows (¢ — 1) 
‘Treatments (—1) 
Error ¢ — 1) — 2) 
Total ( — 1) 


2 
4.43 Standard Errors. The usual formula = holds for the standard 


error of the difference between two treatment means. 
The substitution formula for a missing value is 
rC + t(R + T) — 24 
y= (4.7) 
¢ — 1)@ — 2) 
where the capital letters refer respectively to the totals of the column, 
row, and treatment in which the missing value appears and to the grand 
total. The difference between the mean of the affected treatment and 
the mean of a treatment with no missing value has a standard error 


Pi ile, ie ek ON 
: [? Fi ré— I(r — Al x) 


) 
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4.5 Triple Grouping: Graeco-latin Squares 


4.51 Description. In this arrangement the treatments are grouped 
into replicates in three different ways with the consequence that the ef- 
fects of three different sources of variation are equalized for all treat- 


ments. 
TABLE 4.11 A 5 X 5 GRAECO-LATIN SQUARE 


Litters Pen (weight groups) 
a b c d e 
I | Ai Bs C5 Dy Es 
Il | Be Cy Di Ey As 
Il | C3 Ds EB» Ag By 
IV | Ds Ey Ag Bs Ca 
V | Bs Ag Bs Cr Ds 


The additional grouping (usually represented by greek letters) is de- 
noted here by subscripts. Each treatment (A,-++, HZ) appears once in 
each row and column and once with each subscript (1-- +5). 

One example is an arrangement proposed by Dunlop (4.16) for 
testing 5 feeding treatments on pigs. The arrangement requires 5 pigs 
from each of 5 litters (I, II, III, IV, and V), the effects of litter dif- 
ferences being equalized by making litters correspond to the rows of the 
square. The animals are housed in 5 pens (a, b, ¢, d, and e) which con- 
stitute the columns. The columns are used also to control differences in 
weight; i.e., the heaviest animal in each litter is assigned to the first pen, 
the next heaviest to the second pen, and so on; consequently the columns 
eliminate simultaneously the pen differences and the principal variations 
in weight within litters. Thus far the design is an ordinary latin 
square. 

The subscripts signify the positions of the five feeding crates within 
each pen. This extra control, though possibly unnecessary, was sug- 
gested for several reasons. T’ he first and fifth crates in each pen were of 
different construction from the other crates; moreover the pigs occupying 
the end crates have less “company” when feeding. 

The graeco-latin square has not been used often, probably because the 
units can seldom be balanced conveniently in all three groupings. In the 
example discussed above, the practical difficulty was to secure five litters 
of which each contained five or more pigs. An interesting industrial ap- 
plication is described by Tippett (4.17). 
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The designs have been constructed for all numbers of treatments from 
3 to 12, except 6 and 10; examples are shown in plan 4.2 (p. 120). 


4.52 Randomization. Arrange the rows and columns independently 
at random. Assign the latin letters and the subscripts at random to 
their respective classifications. 


4.53 Statistical Analysis. The sums of squares for rows, columns, 
treatments, and subscripts are all obtained in the usual way. For an 
r Xr square the error has (r — 1)(r — 3) degrees of freedom. This 
number is rather inadequate when r is less than 6. 
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Plan 4.1 (Continued) Selected latin squares 
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Plan 4.2 (Continued) Graeco-latin squares 


CHAPTER 5 


FACTORIAL EXPERIMENTS 


5.1 Description 


5.11 A 2? Factorial Experiment. In a factorial experiment the effects 
of a number of different factors are investigated simultaneously. The 
treatments consist of all combinations that can be formed from the dif- 
ferent factors. To illustrate the simplest case, consider an experiment: 
on sugar beet with 2 factors. These were nitrogen, none (no) versus 3 
ewt. sulphate of ammonia per acre (n;), and depth of winter ploughing 
(7 in. versus 11 in.). Ploughing took place in late January, the nitrogen 
was applied in late April, and the seed was sown early in May. Since 
both factors occur at 2 levels or variations, the experiment is described 
as a 2 X 2 factorial experiment. The 4 treatment combinations are 
shown below, with the mean yields of sugar per acre (cwt.) underneath. 


Treatment combinations and yields of sugar (ewt. per acre) 


1 2 3 4 
(no, 7 in.) (m, 7 in.) (no, 11 in.) (my, 11 in.) 
40.9 47.8. 42.4 50.2 


The yields may be placed in the following 2 X 2 table. 


Nitrogen Response 
Depth no nN Mean ton 
7 in. 40.9 47.8 44.4 +6.9 
11 in. 42.4 50.2 46.3 +7.8 
Mean 41.6 49.0 45.3 


11 in. minus 7 in. +1.5 42.4 


The results might be summarized as follows. Considering the effect 
of nitrogen, we might report that the application of nitrogen increased 
yields by 6.9 ewt. with shallow ploughing and by 7.8 ewt. with deep 
ploughing. These figures are called the s¢mple effects of nitrogen. They 
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represent the type of information that would be wanted, for instance, 
in giving advice to a farmer who always used shallow ploughing but was 
doubtful whether to apply nitrogen. For the simple effects of depth of 
ploughing, we might report that 11 in. ploughing was superior to 7 in. 
by 1.5 ewt. in the absence of nitrogen and by 2.4 ewt. when nitrogen was 
applied. 

There is another way of looking at the results. It sometimes happens 
that the effects of the factors are independent. By this we mean that the 
response to nitrogen is the same whether ploughing is shallow or deep, 
and that the difference between the effects of deep and shallow ploughing 
is the same whether nitrogen is present or not. In this event the two sim- 
ple effects of nitrogen, 6.9 ewt. and 7.8 ewt., are estimates of the same 
quantity and differ only by experimental errors. On this supposition we 
would naturally average the two figures in order to estimate the response 
to nitrogen. The average, 7.4 cwt., is called the main effect of nitrogen. 
Tt ean be derived alternatively as the difference between the two column 
means in the table, 41.6 and 49.0. Similarly the main effect of depth of 
ploughing (11 in. minus 7 in.) is the average of 1.5 ewt. and 2.4 ewt., or 
1.9 ewt. Note that a main effect, being an average of the simple effects, 
is more precisely estimated than the latter. In this experiment the 
standard error of a main effect is 1 /x/2 times that of a simple effect. 

Consequently, if we are sure that the factors operate independently, 
the summary that was given above in terms of simple effects may be 
replaced by another that is both more concise and more accurate. This 
might read as follows. “The application of nitrogen increased the yield 
of sugar by 7.4 cwt., while 11 in. ploughing increased the yield by 1.9 
cwt. as compared with 7 in. ploughing.” It is worth repeating that when 
the factors are independent the figure 7.4 ewt. is the best estimate not 
only of the average response to nitrogen, but also of the response on plots 
ploughed to 7 in. and of that on plots ploughed to 11 in. In other words, 
the whole of the information in the experiment is contained in the main 
effects. 

The question arises: How do we know whether the factors are inde- 
pendent? Frequently the answer is suggested by knowledge of the 
processes by which the factors produce their effects. In the present case 
an agronomist might reason that deep ploughing should enable the plant 
to develop a more vigorous root system. With this the plant should be 
able to utilize more effectively any added nutrient such as nitrogen. 
Thus he might predict that the response to nitrogen would be greater 
with deep than with shallow ploughing, though he probably would not 
expect it to be much greater. In short, he would predict that the two 
factors would not be quite independent in their effects. 
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In addition to the information that may be available from such 
reasoning, a factorial experiment itself provides a test of the assumption 
of independence. For, if the depth of ploughing does affect the response 
to nitrogen, the difference between 7.8 cwt. (the response to nitrogen 
with deep ploughing) and 6.9 ewt. (the response with shallow plough- 
ing) is an estimate of this effect. The difference, 0.9 ewt., can be tested 
in the usual way by a test. If it proves significant, the assumption of 
independence is rejected by the data. The difference (sometimes divided 
by a numerical factor) is called the interaction between nitrogen and 
depth of ploughing. 

Interchanging the roles of the two factors, we may also consider 
whether the superiority of deep over shallow ploughing is affected by the 
presence of nitrogen. To measure the interaction in this case, we sub- 
tract 1.5 ewt. (superiority of deep ploughing when no nitrogen is added) 
from 2.4 ewt. (superiority when nitrogen is added). The difference is 
again 0.9 ewt. It is easy to see that this equality always holds with a 
2 X 2 experiment. Each difference is equal to the sum of the observa- 
tions in one diagonal of the 2 X 2 table, minus the corresponding sum 
for the other diagonal. Such interactions are called two-factor, or first- 
order, interactions. 


5.12 Advantages of Factorial Experimentation When Factors Are 
Independent. The advantages of factorial experimentation naturally 
depend on the purpose of the experiment. We suppose for the present 
that the purpose is to investigate the effects of each factor over some pre- 
assigned range that is covered by the levels of that factor which are used 
in the experiment. In other words the object is to obtain a broad picture 
of the effects of the factors rather than to find, say, the combination of 
the levels of the factors that give a maximum response. One procedure 
is to conduct separate experiments each of which deals only with a 
single factor. Another is to include all factors simultaneously by means 
of a factorial experiment. 

If all factors are independent in their effects, the factorial approach 
will result in a considerable saving of the time and material devoted to 
the experiments. The saving follows from two facts. First, as we have 
seen, when factors are independent all the simple effects of a factor are 
equal to its main effect, so that main effects are the only quantities needed 
to describe fully the consequences of variations in the factor. Secondly, 
in a factorial experiment, each main effect is estimated with the same 
precision as if the whole experiment had been devoted to that factor 
alone. Thus, in the preceding example, half the plots receive nitrogen 
and half do not. Consequently, the main effect of nitrogen is estimated 
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just as precisely as it would be in a simple experiment of the same size 
devoted to nitrogen alone. The same result holds for the effect of depth 
of ploughing. The two single-factor experiments would require twice the 
total number of plots in order to equal the precision obtained by the 
factorial experiment. If there were n factors, all at two levels and all 
independent, the single-factor approach would necessitate n times as 
much experimental material as a factorial arrangement of equal precision. 
The gain from factorial arrangements in this case is very substantial. 

Practical considerations may diminish this gain. The experimenter 
frequently lacks the resources to conduct a large experiment with many 
treatments, and must proceed with only one or two factors at a time. 
Further, it has been pointed out previously that, as the number of treat- 
ment combinations in an experiment is enlarged, the standard error per 
unit increases. This standard error is therefore likely to be higher for a 
large factorial experiment than for a comparable single-factor experi- 
ment, This increase in standard errors can usually be kept small by the 
device known as confounding, described in chapter 6. 


5.13 Factorial Experimentation When Factors Are Not Independent. 
We assume that the purpose is still to investigate each factor over the 
range represented by its levels. When factors are not independent, the 
simple effects of a factor vary according to the particular combination 
of the other factors with which these are produced. In this case the 
single-factor approach is likely to provide only a number of discon- 
nected pieces of information that cannot easily be put together. In 
order to conduct an experiment on a single factor A, some decision must 
be made about the levels of other factors B, C, D, say, that are to be 
used in the experiment (e.g., whether all plots should be ploughed 7 in., 
9 in., or 11 in. deep in an experiment on nitrogen). The experiment re- 
veals the effects of A for this particular combination of B, C, and D, but 
no information is provided for predicting the effects of A with any other 
combination of B, C, and D. With a factorial approach, on the other 
hand, the effects of A are examined for every combination of B, C, and 
D that is included in the experiment. Thus a great deal of information is 
accumulated both about the effects of the factors and about their inter- 
relationships. 

In this connection, Fisher (5.1) has pointed out that it is sometimes 
advisable to introduce into an experiment an extra factor that is not it- 
self of interest, in order that the experiment may form the basis for 
sounder recommendations about the other factors. In agricultural ex- 
perimentation in Britain, farmyard manure has served as a subsidiary 
factor of this kind. Any recommendations made to farmers about other 
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factors will be put to the test in some fields where the farmer has ap- 
plied manure and in others where he has not, so that it is well to in- 
vestigate the other factors in both the presence and the absence of ma- 
nure. 


5.14 Summary Comments. To summarize, the following are some in- 
stances where factorial experimentation may be suitable: 

1. In exploratory work where the object is to determine quickly the 
effects of each of a number of factors over a specified range. 

2. In investigations of the interactions among the effects of several 
factors. From their nature, interactions cannot be studied without test- 
ing some of the combinations formed from the different factors. Fre- 
quently, information is most quickly obtained by testing all combina- 
tions. 

3. In experiments designed to lead to recommendations that must 
apply over a wide range of conditions. Subsidiary factors may be 
brought into an experiment so as to test the principal factors under a 
variety of conditions similar to those that will be encountered in the 
population to which recommendations are to apply. 

On the other hand, if considerable information has accumulated, or 
if the object of the investigation is specialized, it may be more profitable 
to conduct intensive work on a single factor or on a few combinations of 
factors. For instance, some investigations are direeted towards finding 
the combination of the levels of the factors that will produce a maximum 
response. An interesting discussion of procedures for this purpose has 
been given by Friedman and Savage (5.2). They consider the case, 
common in industrial experimentation, where the effect of any specified 
treatment combination can be determined quickly. Thus the treatment 
combinations to be tested can be decided as the experiment proceeds, in 
the light of knowledge gained from combinations that have already been 
tested. They propose first a single-factor experiment to determine the 
optimum level a, of A for fixed levels b, c, and d, say, of the other factors. 
This is followed by a single-factor experiment to determine the optimum 
level by of B for fixed values a1, c, and d. Similarly, C is tested at ay, bi, 
d, and D at a, b;, cy. After the completion of the first “round,” the 
whole process is repeated until the maximum appears to have been 
reached. The authors show that the maximum will usually be reached 
more quickly than with a complete factorial, since the plan is designed 
to concentrate on combinations that are near the maximum. The plan 
would not be feasible for agricultural field experiments, where in general 
an experiment can be changed only once a year. 

Experimenters sometimes find the results of factorial experiments 
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difficult to interpret, because they appear to present a bewildering va- 
riety of treatment comparisons. It is true that the competent summary 
of a large factorial experiment demands an orderly procedure and often 
takes considerable time. If the factors are for the most part independent, 
the method of analysis by means of main effects and interactions (to be 
illustrated later) will reduce the data to manageable proportions. If 
the numerous factors interact in a puzzling manner, prolonged study of 
the results and further experimentation may be needed before the facts 
are mastered. The trouble in this case is that the phenomena are com- 
plex, not that the experimentation is faulty. 


5.2 Calculation of Main Effects and Interactions 


5.21 Notation for the 2” Series. The object of section 5.2 is to ex- 
plain how main effects and interactions are calculated, and how they are 
represented in the analysis of variance of the results. We begin with the 
2” series, where each factor occurs at only two levels. 

For this system the notation used is similar to that of Yates (5.3). 
Letters A, B, C, --- denote the factors. The letters a, b, c, -++ denote 
one of the two levels at which the corresponding factor occurs; for pur- 
poses of clarity this level will be called the second level. The first level 
is signified by absence of the corresponding letter. Thus the treatment 
combination bd, in a 24 factorial experiment, means the treatment com- 
bination which contains the first levels of factors A and C, and the second 
levels of factors B and D. The treatment combination which consists of 
the first level of all factors is denoted by the symbol (1). 

The symbol (ab) will denote the mean of all observations which receive 
the treatment combination ab. The letters A, B, and AB, when they 
refer to numbers, will represent, respectively, the main effects of A and 
B and the A by B interaction. 


5.22 The 2? Factorial Experiment. ‘As already shown in section 5.11, 
we have 

A=H@) -®+@-M 

B= 4a) +0 -@-(O) 


Yates (5.3) introduces the same multiplier 4 for the interaction, which 
he defines as 


AB = 4a) —® -@+ Ol 


These quantities and the general mean M are shown in terms of the 
means for the treatment combinations in table 5.1. 
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TABLE 5.1 Matn EFrEcTs AND INTERACTIONS EXPRESSED IN TERMS OF 
INDIVIDUAL TREATMENT MEANS: 2? FACTORIAL 


Treatment combination 
Factorial 
effect 


Divisor 


(1) (a) (b) (ab) 


+ 
A = + 


Ierliect= 
vw r 


+ 
+ 


AB a af 


The rows of the table express the factorial effects in terms of the 
original means. If the equations represented by the table are solved for 
(ab), (0), ete., in terms of M, A, ete., it will be found that the columns 
of the table enable us to express the original means in terms of the 
factorial effects. For example, from the column for (ab), 


(ab) = M+3[A+ B+ AB] 


The only point to remember is that the factor } occurs with all terms 
except M. From these results, simple effects may be calculated from 
factorial effects. Thus 


(a) — (1) = simple effect of A when B is at the first level = A — AB 
(ab) — (b) = simple effect of A when B is at the second level = A + AB 


These relations are useful when an experiment has been summarized in 
terms of the factorial effects, and it is later desired to estimate some of 
the simple effects. From the above example it may be noted that the 
quantity AB measures the error that is committed in estimating the 
simple effects of A if the two factors are erroneously assumed to be in- 
dependent. 

The functions A, B, and AB satisfy the conditions for an orthogonal 
set of functions (section 3.42). Consequently the squares of the fac- 
torial effects, when suitably multiplied, divide the treatments s.s. into 
three single components, each with 1 d.f. In practice these components 
will usually be calculated from the treatment totals [ab], [b], etc., rather 
than from means. If we define factorial effect éotals as illustrated below, 


[A] = [ab] — [b] + la] — [1] 
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then the sum of squares for A in the analysis of variance is [A??/4r, where 
ris the number of replicates. Corresponding formulae hold for the other 
factorial effects. 


5.23 The 2° Factorial Experiment. In this case there are eight treat- 
ment combinations: (1), a, , c, ab, ac, be, and abe. The simple effect. of 
A is determined for each of four combinations of the other factors: (1), 
b, c, and bc. As before, the main effect of A is defined to be the average 
of these four simple effects. 


A = (abe) — (be) + (ad) — &) + ae) —@ + @ — OI 
Similar definitions hold for B and C. 


The interaction of A with B is now measured separately at each of the 
two levels of C. 


AB (C at second level) = ${(abe) — (be) — (ae) + (c)] 
AB (C at first level) = 4{(ab) — (0) — @ + ()] 


As would be expected, the quantity AB is taken as the average of these 
two effects. Thus 

AB = 4{(abe) — (bc) — (ac) + (¢) + (ab) — (b) — (a) + ()] 
There are two other first-order interactions, AC and BC, which are de- 
fined similarly. 

In addition, we encounter a new interaction. Separate estimates were 
given above for AB at each of the two levels of C. The difference be- 
tween these two estimates measures the effect of C on the AB inter- 
action. This difference, with the conventional factor 4, is 


4{(abe) — (be) — (ae) + (¢) — (ab) + &) + (@) — ()] 


and may be called the interaction of AB with C. If the algebra is carried 
out, it will be found that the same expression measures the interaction 
of AC with B, and that of BC with A. Hence the quantity is called the 
ABC interaction. It is a three-factor, or second-order, interaction. 
Three-factor interactions are more difficult to understand than two- 
factor interactions, Fortunately, in practice three-factor interactions 
are often small relative to main effects and two-factor interactions; 
and quite frequently they can be neglected for the purposes to which the 
results are to be put. Occasionally, cases arise where they are important. 
It might happen, for instance, that factor A does not exert any influence 
unless factors B and C are present in the combination (bc). In this event 
the interaction ABC is as large as the main effect of A or the interactions 
AB and AC. The same effect may occur in less extreme cases, where the 
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combination be is particularly favorable to the response to factor A. If 
three-factor interactions are found to be substantial, a careful scrutiny 
of the simple effects is usually helpful in interpretation. 

The expressions for the factorial effects in terms of the treatment 
means are summarized in table 5.2. The rows of the table give the 


TABLE 5.2 MAIN EFFECTS AND INTERACTIONS EXPRESSED IN TERMS OF 
INDIVIDUAL TREATMENT MEANS: 2° FACTORIAL 


Treatment combination 
Factorial Divisor 
effect 
(@ @ @) (ab) (© (ac) (bc) (abe) 

M ot Ae ar = Gr ar aP an 8 
A = ote = + om = 4 
B az = ae oI = = la ay 4 
Cc = = = 38 ay + oF 4 
AB + = = Ta ar em = ce 4 
AC ar = i = a 4 
BC an F = = = = aR qe 4 
ABC = ae ae = = = Si oe 


factorial effects in terms of the treatment means, while the columns give 
the treatment means in terms of the factorial effects. For example, 


(2) =M+4[A —B—C — AB-—AC+ BC + ABC] 
(a) —() = A — AB — AC + ABC 
(abe) — (1) = A+B+C+ ABC 


As before, the factor $ appears with all terms except M in the expression 
for a treatment mean. The difference between two treatment means, of 
which two examples are given above, is easily found by noting the signs 
in the two columns in question. 

As the reader may verify, the 7 factorial effects are mutually orthog- 
onal, and each is orthogonal to M. If r is the number of replicates, each 


, 80 o 
factorial effect has the same variance, Be or The contribution of 
any effect to the sum of squares for treatments is[ ]?/8r, where [ ] 


denotes the factorial effect total; e.g., 


[A] = — [1] + [a] — [6] + [ab] — [e] + [ac] — [be] + [abe] 
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5.24 The 2” Factorial Experiment. A few formulae will be given 
which apply to any number of factors, all at two levels. With n fac- 
tors, there are n main effects, n(n — 1)/2 two-factor interactions, 
n(n — 1)(n — 2)/6 three-factor interactions, and so on. The successive 
numbers are the coefficients in the expansion of (1+ 1)", omitting the 
first coefficient, unity. 

Two equivalent methods for writing out any factorial effect in terms 
of the original treatment means are available. Each is simple to re- 
member, though the actual writing may take some time in a large ex- 
periment. They will be illustrated by finding the BCDE interaction in 
a 2° experiment, with factors A, B, C, D, E, and F. 


Rule 1. Evens versus odds. In every factorial effect, half the treatment 
combinations receive a + sign and half a — sign. Those which receive 
one sign are those which contain an even number of the letters that ap- 
pear in the factorial effect. In the BCDE interaction 4 letters, b, ¢, d, 
ande, appear. There are8 combinations that contain an even number of 
letters: (1), be, bd, be, ed, ce, de, and bdce. Each of these can be combined 
with any one of 4 combinations of the remaining letters a and f, namely, 
(1), a, f, and af. The 32 terms appear below. Since the terms contain 
abcdef, they receive by convention a + sign. 

qd) a f af 

be abe bef abef 
bd abd bdf abd 
be abe bef abef 
ed acd cdf acdf 
ce ace cef acef 
de ade def adef 
bede abcde bedef + abedef 


The 32 combinations with a — sign are those which have an odd num- 
ber of the letters b, c, d, ore. In detail, these are 


b ab bf abf 
tc ac of acf 
d ad df adf 
e ae ef aef 


bed —abed bedf abe f 
bee abce beef abcef 
bde abde bdef abdef 
cde acde edef acdef 


When calculated from the treatment means, this difference between the 
sums for the two groups of 32 combinations is divided by 2" to give the 
BCDE effect. 
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Rule 2. Algebraic. In this rule the BCDE interaction is expressed 
formally as 


BCDE = 


1 
gr-1 


@+HO-D)eE-HYE€-NYeE-HYF+H) 


Note that — signs appear with the factors that enter into the interaction 
and + signs with those that do not. If this expression is expanded al- 
gebraically, it gives the interaction as a linear function of the treatment 
means. 

As in the 2? and 2° cases, all factorial effects are orthogonal to one an- 
other and to the mean. The contribution of any effect to the sum of 
squares in the analysis of variance is[ ]?/2"r, where [ ] denotes the ef- 
fect total. 


5.25 Factors at More than Two Levels: a 4 x 2 Factorial. Various 
notations are used. For example, the 3 levels of a factor A may be de- 
noted by do, a1, a2, or by ay, a2, a3, or simply by the numbers 0, 1,2. For 
illustration, we give below the treatment totals in a 4 X 2 experiment 
on sugar cane. The treatments were 4 levels of dressing of potash, ko, 
ky, ke, and kg, in arithmetic progression, and 2 levels of phosphate, po 
and p;. There were 5 replicates in randomized blocks. 


TABLE 5.3 Toran yimups oF SUGAR CANE (TONS PER ACRE) 


ko ky ke kg Total 


Po 180 248 277 285 990 
PL 251 307 342 346 1246 


Totals | 431 555 619 631 2236 


The main effect of phosphate (P), which occurs at only 2 levels, is of 
course derived from the comparison of the 2 marginal totals, 990 and 
1246. For the main effect of potash (K) there are 4 marginal totals 
which may be compared. It will be recalled (section 3.42) that 3 in- 
dependent comparisons may be made amongst 4 totals, and that an in- 
finite number of such sets of 3 may be chosen. Thus the main effect of 
K comprises 3 independent comparisons. 

With the 2” system, we were able to define every main effect as a spe- 
cific linear combination of the treatment means. In the present case we 
could select a particular set of 3 independent comparisons, each of which 
would be a specific linear combination of the treatment means. These 
could be defined as the “components” of the main effect of K. However, 
the particular set that is most useful for the interpretation of the results 
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will change from experiment to experiment, so that a formal definition 
of this type would be of limited utility. The experimenter should use 
whichever set appears most relevant. 

Now consider the interaction between P and K. From the differences 
between the two rows in table 5.3, the effect of P is estimated separately 
at each level of K, the estimates being 


ko It ke kg 
(pi — Po) 71 59 65 61 


Any comparison among these 4 figures is a measure of the effect of K on 
the response to P, and therefore is a component of the interaction be- 
tween P and K. Consequently the interaction between P and K con- 
sists of 3 independent comparisons, and is said to have 3 df. As with 
the main effect of K, the particular set of 3 components that will be of 
interest varies with the type of experiment. 

We may also wish to consider the interaction as the effect of P on the 
response to K. As we have seen, the response to K has 3 components. 
Since there are 4 increasing levels of potash dressing, we might choose as 
components the linear, quadratic, and cubic components of the response 
curve. Apart from a divisor, the lmear component is 


—3(180) — 1(248) + 1277) + 3(285) = 344 
at the lower level of P and 

—3(251) — 1(307) + 1842) + 3(846) = 320 
at the higher level of P. The difference between these 2 figures, —24, 
estimates the effect of P on the linear response to K, and is a part of the 


interaction between K and P. The other two components are the ef- 
fects of P on the quadratic and cubic responses to K. 

Ina 2” system, the interaction between P and K is identical with the 
interaction between K and P. In the more general case the correspond- 
ing result is that any component of the interaction between K and P can 
be derived from the components of the interaction between P and K, 
For example, the effect of P on the linear response to K can be written 


—24 = —3(71) — 1(59) + 1(65) + 3(61) 
so that it is a linear function of the responses to P at the 4 levels of K. 
Tn this more general sense, the two interactions are still equivalent. 
We will consider the analysis of variance of the 8 treatment totals in 
some detail. With 5 replicates, the total s.s., on a single-plot basis, is 
fs e 24 ...+ (346)? (2236)? 
(180)? + (251)* ++ “ + +--+ G46)" ¢ a Ne 4165.2 
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The 7 d.f. subdivide into 1 for the main effect of P, 3 for that of K, and 
38 for the PK interaction. In practice the first two terms are calculated 
directly and the interaction obtained by subtraction. The computations 
are 
(1246 — 990)? 
P:———————— 


= 1638.4 
40 
431)? + (555)? + (619)? + (631)? (2236)? 
x, 31? + (655)* + (619)* + (631)? 2236)? _ 
10 0 
TABLE 5.4 PRWLIMINARY ANALYSIS OF VARIANCE OF DATA IN TABLE 5.3 
df. 8.8. m.s. 
NS 1 1638.4 1638.4 
K 3 2518.4 839.5 
PK 3 8.4 2.8 
Total 7 4165.2 


As is common in agricultural experiments, the interaction m.s. is small 
compared with those for the main effects (in fact it is below the mean 
square for error). 

As an exercise we will divide the K main effect and its interaction with 
P into linear, quadratic, and cubie components. The first step is to cal- 
culate the totals for these effects separately at each level of P. The re- 
sults are shown below, the multipliers for the 4 levels of K being shown in 
parentheses. These multipliers are obtained from a table of orthogonal 
polynomials (5.9). 


Ki Ka Ke 
(—8, -1, +1, +8) (41, —1, —1, +1) (—1, +3, —3, +1) 
Po +344 — 60 +18 
pr +320 — 52 10 
Sum (pi + po) +664 —112 +8 
Diff. (p1 — po) — 24 np kel —28 


From the swm line, we obtain the 3 components of the K main effect, 
Ky = xh (664)? = 2204.5, Ky = qp(112)? = 313.6 
K. = 345(8)? = 03 


Similarly, from the difference line, the components of the PK interaction ~ 


are 
PK, = xop(24)” = 2.9, PKy = y(8)? = 1.6 


a 
PK, = 3$p(28)? = 3.9 
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The divisors are found from the usual rule for linear functions (section 
3.42). The three K components add to 2518.4, the total s.s. for K, while 
the PK components add to 8.4, the sum of squares for PK. The detailed 
analysis of variance is shown in table 5.5. 


TABLE 5.5 Mor& DETAILED ANALYSIS OF VARIANCE OF THE DATA 
IN TABLE 5.3 


df. 8.8. Or M.8. 
FB uk 1638.4 
Ki 1 2204.5 
Ky 1 313.6 
Ke 1 0.3 
PRi 1 2.9 
PK, 1 1.6 
PK. 1 3.9 


Since every component has 1 d.f., the mean squares are the same as 
the sums of squares. The error m.s. is 16.35, with 28 d.f. The quadratic 
component of K is significant, indicating a falling off in the response at: 
the higher levels of application. Neither the cubic component nor any 
of the interactions approaches significance. 

5.26 A 3? Factorial Experiment. As a second example, the treatment 

totals are shown in table 5.6 for an experiment with 3 levels of nitrogen 

fertilizer and 3 of phosphate fertilizer. The data are the numbers of 

lettuce plants that emerged from the ground and are totals over 12 plots 
TABLE 5.6 NuMBERS OF LETTUCE PLANTS EMERGING te 


no m ng Totals 


m | 449 (4a) 413 (C8) 326 (By) | 1188 
pm | 409 (BB) 358 (Ay) 291 (Ca) | 1058 
mm | 341 (Cy) 278 (Ba) 312 (AB) | 931 


Totals| 1199 1049 929 8177 
* The use of the latin and greek letters will be explained later. 


each. Both nitrogen and phosphate appear to have had a deleterious 
effect on emergence (the subscript 2 denotes the largest application). 
The main effects of N and P both comprize two independent com- 
parisons, and thus have 2 df. each. Since the amounts of N and P were 
in arithmetic progression, it would probably again be appropriate, as in 
the previous example, to choose the linear and quadratic components of 
the regression on amount of dressing as the individual components of the 


main effects. 
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The N; component can be estimated separately at each level of P. 
This means that there are 2 d.f. which measure the effect of P on the 
linear response to N. These are a part of the NP interaction. Another 
2 df. are supplied by the effect of P on the N, component, so that the 
NP interaction contains 4 df. 

From inspection of the margins of table 5.6, it is evident that the main 
effects of both N and P are approximately linear. Consequently, the 
most interesting single degree of freedom from the interactions is likely 
to be the interaction of N; with P;. In many agricultural experiments 
this type of interaction is the only one that approaches significance. It 
is worth while calculating this interaction separately in the analysis of 
variance. The other three components, N;P4, NgPi, and N,P,, will also 
be obtained. The initial computations appear in table 5.7. 


TABLE 5.7 CatcuLaTIion OF LINEAR AND QUADRATIC EFFECTS FOR THE 
ANALYSIS OF VARIANCE 


Ni Ny Pi Py 
(=1, 0, 1) (1, —2, 1) (=1, 0, 1) (1, —2, 1) 
Po 123 eal no —108 —28 
Pi -118 — 16 nm —185 —25 
pe — 29 +97 ne S14 +56 
Sum —270(Ni) + 30(Nq) Sum —257(Pi) + 3(P,) 
Pi + 94(NiP) —- +148(N{Pi) NM + 94(PiNi) +-84(P_Ni) 
Py + S4(MP,) + 78(NgP,) | Ng +US(PiN,)  +78(P,N' a) 
The left side of the table shows the N; and N, effects for each level of 
P, while the right side shows the P; and P, effects for each level of N. 


For instance, from table 5.6, 
Nipo = 326 — 449 = —123; Pang = 449 — 2(409) + 341 = —28 


The column sums give the individual components of the N and P main 
effects. Now consider the difference between the third and the first row. 
For the N; column, this difference (+94) gives the linear effect, of P on 
N:i.e., the NP; interaction. From the P; column, the difference gives 
the linear effect of N on P;, or the P,N; interaction, which is exactly the 
same as the NP; interaction. The other two columns provide the N,P1 
and the NP, effects. 

Finally, the sum of the first and third rows, minus twice the second 
row, leads to the components of interaction that contain a quadratic 
term. It will be seen that all four components can be obtained from 
either the left or the right half of the table, so that in practice only one 
half is required. By computing both halves we verify the symmetry of 
the components with respect to V and P. 
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Tne squares of these quantities, with appropriate divisors, will give an 
analysis of variance of the 8 d.f. amongst the treatment totals into 8 
single components. Since an individual entry in table 5.6 is the total 
of 12 plots, the divisors may be verified to be as shown below. 


NrorP; NqorPg NiPi NiPy NePi NiPa 


Divisor 72 216 48 144 144 432 


For instance, the divisor for NP, may be worked out as follows. The 
three Nj figures in table 5.7 (ie., —51, —16, +97) each have divisor 
12(12 + 2? + 12), or 72. Since the NP, total is a linear function of 
these three figures, with coefficients 1, —2, and 1, this total has divisor 
72(12 + 22 + 12), or 432. The analysis of variance is shown in table 5.8. 


TABLE 5.8 SUBDIVISION OF THE TREATMENT 8.8. 


df. 8.8. OF M.8. 
M 1 1012.50 
Ng I 4.17 
Py 1 917.35 
Py 1 0.04 
NiPi 1 184.08 
MiPy 1 49.00 
NqPi 1 152.11 
NyPa 1 14.08 


The error m.s. in this experiment is about 59. The linear effects of both 
fertilizers are significant, with no indication of curvature. The N;P; ef- 
fect is significant at the 10% level, but not at the 5% level. 

In table 5.6, p. 135, latin and greek letters were superimposed so as to 
form a3 X 3 graeco-latin square. This square leads to another method 
of calculating the sum of squares for the interactions (4 df.). Although 
the method is not likely to be of use for purposes of interpretation, it has 
formed the basis of some ingenious devices in the construction of de- 
signs. _In the square, the column totals represent the main effects of 
and the row totals those of P. Since the latin letter totals are orthogonal 
to rows and columns, it ‘seems reasonable to suppose that they must 
represent two of the 4 components of the NP interaction. Similarly, 
the greek letter totals represent the remaining 2 components. These 
totals are shown below. 


A B Ci Total a B y Total 
1119 1013 1045 3177 1018 1134 1025 83177 


. 
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Each figure is now a total of 36 plots. The sum of squares of deviations 
of the latin letter totals, divided by 36, is 164.22, and the corresponding 
figure for the greek letters is 235.06. These add to 399.28, which is the 
same as the total s.s. for the interactions in table 5.8, apart from round- 
ing differences. 


5.27 General Method of Analysis. Suppose that there are three fac- 
tors, A, B, C, which occur at a, 8, and y levels respectively. The main 
effects have (a — 1), (8 — 1), and (y — 1) components or degrees of 
freedom respectively. Each component of the main effect of A can be esti- 
mated separately at each of the 6 levels of B. Thus each component of A 
contributes (8 — 1) degrees of freedom to the AB interaction. Thismeans 
that the AB interaction contains a total of (a — 1)(8 — 1) components 
or degrees of freedom. Similarly the AC interaction has (a — 1)(y — 1) 
degrees of freedom, and the BC interaction has (8 — 1)(y — 1) degrees 
of freedom. 

To compute the sums of squares for the main effects and first-order 
interactions, we form two-way tables for each pair of factors. Consider 
the A by B two-way table. The total s.s. among cells has (a8 — 1) de- 
grees of freedom. From the marginal totals in the table we compute 
the sum of squares for the main effect of A, with (a — 1) degrees of free- 
dom, and that for the main effect of B, with (6 — 1) degrees of free- 
dom. By subtraction, the sum of squares for the AB interaction, with 
(a — 1)(8 — 1) degrees of freedom, is obtained. 

There remains the three-factor, or ABC, interaction. Now each of 
the (a — 1)(8 — 1) components in the AB interaction is estimated sep- 
arately at each level of C. It will therefore contribute (vy — 1) degrees 
of freedom to the ABC interaction, so that the latter contains in all 
(a — 1)(8 — 1)(y — 1) degrees of freedom. The sum of squares for ABC 
is also obtained most easily by subtraction. From the total s.s. amongst 
treatments, with (a8y — 1) degrees of freedom, subtract the sums of 
squares for A, B, C, AB, AC, and BC. The remainder will be the sum of 
squares for ABC. 

It is hoped that the reader will find no difficulty in extending these 
methods to the case where there are more than three factors. In gen- 
eral, the sums of squares for main effects are calculated directly, and those 
for interactions are calculated by subtraction. To compute an ABCD 
interaction, for instance, we require a four-way table for the four factors 
represented. From the sum of squares for this table we subtract the sum 
of squares for all main effects and two- and three-factor interactions 
among the factors in question. Calculations must be checked by re- 
computation. Of course, if the experimenter subdivides any interaction 


5.28 INTERPRETATION OF THE ANALYSIS: FIRST EXAMPLE = 139 


into single components as in the previous section, a check is provided by 
this process. 

In the general case it remains true that any component of a factorial 
effect is orthogonal to any component of any other factorial effect. Thus 
any component of the BC interaction is orthogonal to any component of 
the A main effect, or of the BCD interaction, etc. Two components of 
the same factorial effect may or may not be independent. For instance, 
if A occurs at three levels, the comparison (a2 — do) is independent of the 
comparison (a2 — 2a; + ao), but is not independent of (a; — ao). 

The following selected references contain a discussion of factorial ex- 
periments with some worked examples. 


5.8 Yarus, F. The design and analysis of factorial experiments. Imp. Bur, Soil 
Sci. Tech. Comm. 35, 1937. This gives the most comprehensive account that 
is available, with numerous worked examples. 

5.4 Yares, F. Complex experiments. Jour. Roy. Stat. Soc. Suppl. 2, 181-247, 
1935. An earlier reference, with examples of 2?, 23, 4 X 3, and 3° factorials. 

5.5 Trererr, L. H.C. The methods of statistics. Williams and Norgate, London, 
2nd ed., 1937. The use of factorial designs in industrial experimentation is in- 
dicated. A worked example is given of the breakdown of the treatment s.s. 
ina 5 X 4X3 X 2 factorial in single replication. 

5.6 Lanpquist, E. F. Statistical analysis in educational research. Houghton. 
Mifflin, Boston, 1940. The application of factorial designs in experiments on 
methods of teaching is described, with a worked analysis of variance for a 


4 X 8 design completely randomized. 
5.7 Buss, C. I. Factorial design and covariance in the biological assay of vitamin 
D. Jour. Amer. Stat. Assoc. 35, 498-506, 1940. Worked example of a 3 X 2 


design. 


5.28 Interpretation of the Analysis: First Example. The separation 
of the treatment comparisons into main effects and interactions is a con- 
venient and powerful method of analysis in cases where interactions are 
small relative to main effects. When interactions are large, this analysis 
must be supplemented by a detailed examination of the nature of the 
interactions. It may, in fact, be found that an analysis into main effects 
and interactions is not suited to the data at hand. There is sometimes a 
tendency to apply the factorial method of analysis mechanically without: 
considering whether it is suitable or not, and also a tendency to rely too 
much on the initial analysis of variance alone when writing a summary of 
the results. Below two examples are presented where the initial analysis 
of variance is not very informative, and where the results can be sum- 
marized better in terms of simple rather than factorial effects. 

The first experiment was conducted by the Wailuku Sugar Company. 
Three varieties of sugar cane were compared, in combination with three 
levels of nitrogen (150, 210, and 270 lb. N per acre respectively). The 
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crop was the second harvesting, or the first ratoon crop. In table 5.9 
only the relevant part of the analysis of variance is shown: i.e., the sum 
of squares for the main effects of V and N and for the VN interaction, 
plus the errors.s. The data are in tons of cane per acre. The conclusions 


TABLE 5.9 ANALYSIS OF VARIANCE OF A 3 X 3 SUGAR CANE EXPERIMENT 


df. ‘8.8. m.s. 
V 2 319.38 159.69 * 
N 2 56.54 28.27 
VN 4 559.79 139.95 * 
Error 24 1053.84 43.91 


from the analysis of variance are that the main effects of V and the VN 
interaction are both significant, but there is no sign of a main effect due 
to N. This statement tells little about the results of the experiment. 

The treatment totals (over 4 replications) are shown below with their 
standard errors. Since the standard error per plot is V 43.91 = 6.626, 
the standard error for a treatment total is 13.3 as shown. 


TABLE 5.10 Treatment TOTALS (TONS) IN A 3 X 3 SUGAR CANE EXPERIMENT 


(+13.3) 
no mM ne Total 8.e. 
1 266.1 275.9 303.8 845.8 
9 245.8 250.2 281.7 MEW 23.0 
v3 274.4 258.1 231.6 764.1 
Total 786.3 784.2 817.1 2387.6 
8.e. 23.0 


Instead of having no effect, nitrogen has apparently given a steady in- 
crease in yields with the first two varieties, but a steady decrease with the 
third variety. Further, the significant main effects of varieties apply 
only to the average varietal yields over all 3 dressings of N, and not to 
yields with a particular rate of dressing. On the average 2; gives a sub- 
stantially higher yield than v3, but at the lowest level of N, vg is slightly 
above 0. 

The subsequent analysis may be made either by means of t-tests ap- 
plied to table 5.10 or by means of a further subdivision of the analysis of 
variance. It is of interest to examine the response to N for each variety 
separately. To test the linear responses by means of a t-test, we require 
the standard error for (nz — mo) as computed for each variety. This 
standard error is 13.3 X 1.414, or 18.8, and since the 5% t-value for 24 
df. is 2.064, the quantities (nz — no) must attain the value 18.8 X 2.064, 
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or 38.8, in order to be significant at the 5% level. The actual values are 
(nz — No)v%1 = +37.7; (nz — no)ve = +35.9; 
(ng — no)v3 = —42.8 y 


Thus neither of the increases with v; and vp quite reaches the significance 
level, though both are close to it. The decrease with v3 is significant. 

The test of linearity of the response curves is made by means of the 
quantity (ng + mp — 2m), calculated for each variety. The standard 
error of this quantity is 13.3 X +/6, or 32.6. The reader may verify 
from table 5.10 that none of the curvature terms even exceeds its stand- 
ard error, so that there is no occasion to reject the hypothesis that the 
responses are linear. 

It is not quite so clear what tests are appropriate for appraising the 
varietal differences. Since, however, the difference between v; and v is 
very consistent at all levels of N, a t-test of the total difference is sug- 
gested. This difference is 68.1, and since the value required for signifi- 
cance is 23.0 X 1.414 X 2.064, or 67.1, the superiority of v1 over v2 is 
just significant. Further, it is evident on inspection that vg does not dif- 
fer significantly from the other varieties at either of the two lower levels 
of NV. At the highest level, v3 is significantly below both 2 and 0. 

When interactions are large, much care is required in the preparation 
of a statement that summarizes the results, and it is not easy to reach a 
form that is free from criticism. The following is a suggestion for this 
example. 

“The increase in yield of cane to the highest dressing of V (270 Ib. per 
acre) over the lowest dressing (150 lb. per acre) was 9.4 tons per acre with 
», and 9.0 tons per acre with v2. Both increases just failed to be signifi- 
cant at the 5% level. With v3, on the other hand, the highest dressing 
of N decreased the yield significantly by 10.7 tons per acre as compared 
with the lowest dressing. For all three varieties the effects of N appeared 
to be proportional to the amount applied, within the range investigated 
in this experiment. 

“Variety 1 gave a higher yield than variety 2 for all levels of N, the 
average difference, 5.7 tons, being just significant at the 5% level. The 
yields for variety 3 did not differ significantly from those of the other 
varieties at the two lower levels of N. At the highest level of NV, the yield 
for variety 3 was significantly lower than that for either of the other va- 


rieties.” 


5.29 Interpretation of the Analysis: Second Example. This example 
is more complex, mainly because the factorial (a 3 X 3) is of an unusual 
type. The data come from a long-term experiment on meadow hay, 
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conducted at Lady Manner’s School, Bakewell, England, with the co- 
operation of Rothamsted Experimental Station. The yields are for the 
1937 season. The two factors are shown schematically below. 


First factor (3 levels) Second factor (3 levels) 
No manure ect No manure eas 
Mixed artificialay “PPHed m Mixed ee gue ean 
1936 | 1937 
8 tons compost 8 tons compost 


The nature of the experiment may become clear from a discussion of some 
of the individual treatment combinations. In any replicate there are 
nine plots, of which three received mixed artificial fertilizers in 1937. 
In the previous year, 1936, one of these three plots received no manure, 
one received mixed artificials, and one received compost. Thus these 
three plots enable us to compare the residual effects of the 1936 appli- 
cations of artificials and compost, on plots which received artificials in 
the current year, 1937. Similarly, from other treatment combinations, 
we can compare the residual effects of artificials and compost on plots 
which were unmanured in the current year and on plots which received 
compost in the current year. 

Further, in any replicate there are three plots which were unmanured 
in 1936. Of these, one had no manure, one had artificials, and one had 
compost in 1937. Consequently we may also assess the direct effects of 
artificials and compost applied in 1937 on plots which were unmanured 
in the previous year, and likewise on plots which received artificials or 
compost in the previous year. Both the artificials and compost contain 
the three common plant nutrients, nitrogen, phosphorus, and potash. 

The system of treatments is an ingenious one, designed to measure 
direct and residual effects at the same time. The experiment was started 
in 1932, but for simplicity we will ignore any effects of treatments applied 
prior to 1936. 

The treatment means and the analysis of variance are shown in tables 
5.11 and 5.12. The analysis of variance was calculated from the treat- 
ment means, rather than from single plots, and the error m.s. has been 
adjusted so as to apply to a treatment mean. The direct effects are 
highly significant, the interactions are significant at the 5% level, but 
residual effects are not significant. In considering the nature of the ef- 
fects leading to these results, it is again convenient to think in terms of 
simple effects rather than of main effects and interactions. It is worth 
noting that the standard error of a single entry in the two-way table is 


V 6.656, or 2.58, while that for the difference between two entries is 
3.65. 
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With regard to residual effects, table 5.11 shows that compost pro- 
duced a large and highly significant increase, 13.4 ewt. + 3.65, on the 
plots that received no manure in the current year. On plots that re- 
ceived manure in 1937, either artificials or compost, there is no suggestion 
of any residual effect of compost. None of the residual effects of arti- 
ficials approaches the level required for significance. These results are 


TABLE 5.11 TREATMENT MEAN YIELDS (HAY, CWT. PER ACRE) (2.58) 


Residual effects of 1936 treatments 


Direct effects of — 

1937 treatments None Artificials | Compost Mean 
None 53.6 56.8 67.0 59.1 
Artificials 80.8 82.3 80.5 81.2 
Compost 74.3 69.1 70.0 71.1 
Mean 69.6 69.4 72.5 70.5 


TABLE 5.12 ANALYSIS OF VARIANCE OF TREATMENT MEANS 


df. 8.8. m.8. 
Direct effects 2 732.28 366.14 ** 
Residual effects 2 18.24 9.12 
Interaction ~ 4 97.01 24.25 * 
Error 24 159.74 6.656 


in accord with general fertilizer experience, since a compost is more 
likely to give residual effects than an inorganic fertilizer, and since re- 
sidual effects would be expected to show up most clearly on plots which 
have no manures during the current year. 

So far as direct effects are concerned, artificials were superior to com- 
post whatever the residual treatment. 


1936 manuring 1937 artificials — 1937 compost (ewt.) 
No manure +6.5 
Artificials +13.2 
Compost +10.5 


The differences among these three figures are within the limits of ex- 
perimental error. To test this point, we find the sum of squares of 
deviations of these differences, which comes to 22.73. This is divided by 
2 to make it comparable with the analysis of variance, giving 11.36, 
which represents two of the 4 d.f. for interactions. The mean square, 
5.68, is slightly below the error m.s. Thus the superiority of artificials 
appears to be consistent. 
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The increases for artificials and compost over no manure are much re- 
duced on plots that received compost in the previous year. 


1936 manuring (1987 artificials — 1937 none) (1937 compost — 1937 none) 


Compost +13.5 +3.0 
Artificials and none +26.4 +16.5 
(averaged) 


The striking reduction on plots with 1936 compost is simply a reflection 
of the nature of the residual effect of compost as previously examined. 

The conclusions from the experiment might be summarized as follows. 

“Artificials applied in 1937 increased yields by 13.5 ewt. on plots that 
received compost in the previous year, and by 26.4 ewt. on plots that did 
not receive compost in the previous year. The corresponding increases 
due to 1937 compost were 3.0 ewt. and 16.5 ewt., respectively. The su- 
periority of 1937 artificials over 1937 compost, which averaged 10.1 
ewt., appeared to be independent of the type of manuring during the 
previous year. , 

“As regards residual effects, compost applied in 1936 increased yields 
by 18.4 ewt. on plots that were unmanured in 1937, but gave no apparent 
increase on plots manured in 1937. There were no significant residual 
effects of 1936 artificials.” 

It seems evident that the significant interaction m.s. in the analysis 
of variance must arise mainly from the fact that compost had a residual 
effect only when no manure was applied currently. As an exercise it may 
be instructive to isolate the part of the interaction s.s. that is due to this 
effect. Since artificials appeared to have no residual effect, we will com- 
bine the 1936 artificials with the 1936 unmanured plots. Consequently, 
the residual effects of compost are obtained from the comparison 


2(1936 compost) — (1936 artificials) — (1936 none) 


This quantity may be calculated separately for each of the 1937 treat- 
ments. Its values are 


(1937 none) = +23.6; (1937 artificials) = —2.1; 
(1937 compost) = —3.4 


The contrast between the residual effect of compost on plots without 
1937 manures and that on plots with 1937 manures may be estimated 
from the comparison 


2(-+23.6) — (—2.1) — (-8.4) = +527 
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The square of this quantity, with a suitable divisor, is a single com- 
ponent of the interaction s.s. Since the divisor for the quantities 23.6, 
etc., is 6, the divisor needed is 6 X 6, or 36. Hence the sum of squares 
for this component is (52.7)?/36, or 77.15. The remaining 3 df. for in- 
teractions have a sum of squares equal to 19.86. The mean square, 
6.62, is no larger than the error m.s. This verifies the suggestion that 
the significance of the interaction m.s. can be attributed to the type of 
residual effect of compost. 

Some readers may have difficulty in satisfying themselves that the 
component isolated above really is a part of the interaction s.s. It may 
be helpful to express the basic quantity, 52.7, as a linear function of the 
original mean yields in table 5.11. The multipliers of the means will be 
found to be as shown below. 


1937 treatments 1936 treatments 
None Artificials © Compost 


None —2 —2 4 
Artificials 1 1 —2 
Compost aN 1 —2 


The sums of the coefficients are zero over every row and column so that 
the expression is orthogonal to both sets of main effects. Hence it must 
be a part of the interaction. The coefficients also enable us to verify the 
divisor, 36, which is equal to the sum of the squares of the coefficients. 

Since the interactions can be attributed to the behavior of a single 
group of plots (those with compost in 1936 and no manuring in 1937), it 
might be suspected that this treatment had been allotted by chance a 
favorable set of plots. The residual increase, 13.4 ewt., does seem rather 
large in relation to the direct effect of compost. An examination of 
previous results does not lend much weight to this suspicion. Over the 
4 preceding years, the average residual response to compost, with no 
current manure, was 11.1 ewt. In 2 years, 1935 and 1933, the plots in- 
volved in this comparison were the same as those in 1937. For these 
years the average residual effect was 6.2 ewt. In the other 2 years, when 
a different set of compost plots is involved, the average residual effect 
was 16.0 cwt. 

The two preceding examples are intended to illustrate the fact that 
the most informative subdivision of the treatment comparisons depends 
on the type of experiment. An analysis copied from a model that ap- 
pears similar in form may be inappropriate or even meaningless. The 
experimenter should first decide which comparisons are necessary for the 
interpretation of the results. The subsequent analysis, either by t-tests 


146 FACTORIAL EXPERIMENTS 5.29 


or by subdivision of the analysis of variance, should be directed towards 
these comparisons. 


5.3 Designs for Factorial Experiments 


5.31 Factorials in Complete Block Designs. Most types of experi- 
mental plan are suitable for factorial experiments. In particular, if the 
total number of treatment combinations is not large, the designs de- 
scribed in chapter 4 are frequently used. The relative advantages of 
complete randomization, randomized blocks, and latin squares are the 
same with factorial as with non-factorial sets of treatments. For il- 
lustration, we show arrangements for a 4 X 2 factorial (i) in 8 random- 
ized blocks and (ii) in an 8 X 8 latin square. In the former case, only 
the first replicate is given. 
TABLE 5.13 FacrortaL EXPERIMENT ARRANGED IN (i) RANDOMIZED BLOCKS 
AND (ii) A LATIN SQUARE 
8 replications 8 treatment combinations 
A at 4 levels (1,2,3,4) 
B at 2 levels (1,2) 


i. In randomized blocks Analysis of variance 
Rep. I Replications 7 
12 Treatment combinations ud 
A 3 
> B 1 
z AB 3 
32 Error 49 
42 Total 63 
11 
22 
41 
ii. In a latin square Analysis of variance 
42 11 (22) 12) 31) 41 32) 21 Rows 7 
PU AB BRATS A ae re combinations : 
12 41 42 11 382 21 22 81 By 3 
81 22 21 42 11 32 41 12 B 1 
382 12 11 31 21 22 42 41 AB 3 
er yer ae Si ar ah Error 42 
11 21 12 32 41 42 31 22 Total 63 


22 31 41 21 42 11 12 32 


In the next section, a worked example of the analysis of a large fac- 
torial experiment in randomized blocks is presented. 
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TABLE 5,14 WEIGHTS OF DENERVATED (y) AND CORRESPONDING NORMAL (x) 
MUSCLE (unit = 0.01 gram) 


Rep. I Number of treatment periods daily 
Length of One (a) Three (as) Six (ae) 
treatment Type of c 1 
(minutes) current y x y = y © 

1(b1) Galvanic 72 152 74 131 69 131 
Faradic 61 130 61 129 65 126 

60 cycle 62 141 65 112 70 11 

25 cycle 85 147 76 125 61 130 

2 (bx) Galvanic 67 136 52 110 62 122 
Faradic 60 ahbl 55 180 59 122 

60 cycle 64 126 65 190 64 98 

25 cycle 67 123 72 17 60 92 

3 (bs) Galvanic 57 120 66 132 72 129 
Faradie 72 165 43, 95 43 97 

60 cycle 63 112 66 130 72 180 

25 cycle 56 125 75 130 92 162 

5 (bs) Galvanic 57 121 56 160 78 135 
Faradice 60 87 63 115 58 118 

60 cycle 61 93 79 126 68 160 

25 cycle 73 108 86 140 71 120 

Rep. II 

1 (6) Galvanic 46 97 74 131 58 81 
Faradice 60 126 64 124 52 102 

60 cycle 71 129 64 117 71 108 
25 cycle 53 108 65 108 66 108 

| 2 (be) Galvanic 44 83 58 7 54 97 
| Faradic 57 104 55 112 51 100 
60 cycle 62 114 61 100 79 115 

25 cycle 60 105 78 112 82 102 

3 (bs) Galvanic 53 101 50 103 61 115 
Faradice 56 120 57 110 56 105 

60 cycle 56 101 56 109 71 105 

25 cycle 56 97 58 87 69 107 

5 (bs) Galvanic 46 107 55 108 64 115 
Faradic 56 109 55 104 57 103 

60 cycle 64 114 66 101 62 99 


25 cycle 59 102 58 98 88 135 
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5.32 Numerical Example: a 44X83 Factorial in Randomized 
Blocks. A number of experiments have indicated that electrical stimu- 
lation may be helpful in preventing the wasting away of muscles that are 
denervated. A factorial experiment on rats was conducted by Solandt, 
DeLury, and Hunter (5.8) in order to learn something about the best 
type of current and the most effective method of treatment. The fac- 
tors and their levels are shown below. 


A B Cc 
Number of treatment Length of treatment Type of current 
periods daily (minutes) 
1 1 Galvanic 
3 2 Faradic 
6 3 60 cycle alternating 
5 25 cycle alternating 


Treatments were started on the third day after denervation and con- 
tinued for 11 consecutive days. There are 48 different combinations of 
methods of treatment, each of which was applied to a different rat. Two 
replications were conducted, using 96 rats in all. 

The muscles denervated were the gastrocnemius-soleus group on one 
side of the animal, denervation being accomplished by the removal of a 
small part of the sciatic nerve. The measure used for judging the ef- 
fects of the treatments was the weight of the denervated muscle at the 
end of the experiment. Since this depends on the size of the animal, the 
weight of the corresponding muscle on the other side of the body was in- 
cluded as a covariance variate. The data are shown in table 5.14. 

For a covariance analysis we require analyses of variance for both the 
denervated muscle (y) and the normal muscle (z), and the analysis of the 
product (yx). From the discussion in section 5.27 and preceding sec- 
tions, the reader should have little difficulty in completing the analysis. 
The replicate totals, treatment-combination totals, and the grand total 
are first computed. From the treatment totals, the three two-way tables 
are constructed, as given in table 5.15. Since each set of main effect 
totals is obtained twice, this part is self-checking (though only one set of 
main effect totals need be recorded). To illustrate the computations 
made on the A X B table, we have for the analysis of y 


Total s.s. for A X B table 


= A{(510)? + (643)? +--++ (646)?] — yl[(6069)2] = 1039 
gig[(1936)° + (2028)? + (2105)?] — g4[(6069)?] = 447 


= giz[(1565)? + (1488)? + (1476)? + (1540)?] — g4s[(6069)7] = 223 
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By subtraction, the AB sum of squares is found to be 369. For the sum 
of products, we replace each y? by the corresponding yx product. Thus 
Total sp. = $[(610)(1030) + (643) (977) +--+ (646) 85)] 

— giy(6069) (11,307) = 1151 


Finally, the ABC sum of squares is found by calculating the total s.s. 
among all 48 treatment combinations, and subtracting the sum of squares 
for A, B, C, AB, AC, and BC. 


TABLE 5.15 Two-way TABLES IN A 4 X 4 X 3 FACTORIAL 


y x 
ay ag a6 Total a ag ag Total 
B by 510. 0-543 512 | 1,565 1,030 977 897 2,904 
Length of be 481 496 511 | 1,488 902 1,038 848} 2,788 
treatment b3 469 471 536 | 1,476 941 896 1,000] 2,837 
bs 476 518 546 | 1,540 841 952 985] 2,778 
C G 442 485 518 | 1,445 917 992 925 | 2,884 
Type of F 482 453 441 | 1,376 952 969 873 | 2,794 
current 60 503 522 ©5557 | 1,582 930 985 976] 2,891 
25 509 568 589 | 1,666 915 917 956| 2,788 
A totals 1,936 2,028 2,105 | 6,069 3,714 3,863 3,730 | 11,307 
y © 


Fle, Oe pay | [CMe tele Ui bs 


Cc G|393 3387 359 356 | 723 665 700 746 

Type of F | 363 387 327 349 | 737 729 692 636 
current 60 | 403 395 384 400 | 718 743° «737 += 698 
25| 406 419 406 435 | 726 651 708 703 


The analyses of y*, yx, and 2 are given in table 5.16. The regres- 
sion coefficient of weight of denervated on weight of normal muscle is 
3977/16,013, or 0.248361. The next step is to form the analysis of 
(y — bx). This is done by adding to any y? value (—2b) times the cor- 
responding yx value, plus (b?) times the corresponding x? value. The 
mean squares for (y — bz) are the quantities used to test the significance 
of treatment effects on the weights of denervated muscles, adjusted for 
their regression on the weights of the normal muscles. As shown in 
section 3.86, the treatment m.s. is slightly inflated because of the sam- 
pling error of the regression coefficient. The inflation is sufficiently 
small, however, that only effects which appear to be on the borderline of 
significance need be tested by the exact method (section 3.86), which is 
more laborious to calculate. 
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In the analysis of variance (table 5.16) only two factorial effects are 
significant: those for the main effects of C (type of current used) and for 
the main effects of A (number of treatment periods daily). The AC in- 
teraction m.s. is somewhat higher than the error m.s., but does not ap- 
proach the 5% significance level. These results suggest that in preparing 


TABLE 5.16 ANALYSES OF VARIANCE 


y yx 2 (y — bx)® 
df. S.S. 8.p. S.S. 8.5. m8. 
Replications 1 605 2,503 10,354 0 Aan 
A (number of treatments) 2 447 64 418 441 220.5 * 
B (length of period) 3 223 137 415 181 60.4 
C (type of current) 3 2,145 104 281 2,111 703.7 ** 
AB 6 369 950 5,202 218 36.3 
AC 6 645 476 1,014 471 78.5 
BC 9 299 11 2,015 418 46.4 
ABC 18 1,050 666 5,198 1,040 57.8 
Error 471 3,199 3,977 16,013 2,211 48.1 
Total 95 * 8,982 8,888 40,910 MAO DLS = sais uis00 


146 d.f. for (y — bx)”. 
294 dif. for (y — bx)?. 


Nore. In reference (5.8), the sums of squares for AC and AB@ are in error. 


summary tables for further examination and for presentation of the 
results, we probably need only the AC two-way table. The averages 
for B (length of an individual treatment) should also be considered in 
case they indicate a trend effect that was not marked enough to attain 
significance. These data appear in table 5.17. 


TABLE 5.17 WeIGHTS OF DENERVATED MUSCLE, ADJUSTED FOR REGRUSSION ON 
WEIGHT OF NORMAL MUSCLE (unit = 0.01 gram) 
AC two-way table (s.e. +2.47) 
Number of treatments 


B means a a3 a, |Means 
b «64.4 
be 62.4 G | 56.0 59.1 65.2 | 60.1 
bs 61.4 Type of F|60.0 55.8 57.3 | 57.7 ae 
bs 64.7 current 60/63.2 63.9 68.6/65.2 %® +1: 
se. £1.43 25 | 64.5 71.8 73.2 | 69.8 
Means 60.9 62.6 66.1 | 63.2 


se. £1.24 
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The means have been adjusted in the usual way for the regression 
on the weight of the normal muscle. Because of this adjustment, every 
mean has a slightly different standard error. However, as pointed out 
in section 3.85, it is sufficiently accurate to use an average standard 
error. This is computed from the effective error m.s., which is 48.1 X 
1.019, or 49.0, as described in section 3.85. Thus the standard error for 


an entry in the two-way table is V49.0/8, or 2.47. 

The 25 cycle alternating current gave a significantly higher mean 
weight than any other type of current. The 60 cycle alternating current 
was superior to both galvanic and faradic currents, the last two not being 
significantly different. The weights increased as the number of treat- 
ment periods daily increased. On inspection the increase appears ap- 
proximately linearly related to the number of treatment periods. 

To test this supposition, we may fit a regression of the adjusted A 
totals on the number of treatment periods (the adjusted A means do not 
carry enough decimals to check with the analysis of variance). The 
relevant data are shown below. 


Adjusted — 2 = number 
A totals of periods 


ay 1950 1 
ag 2005 3 ¥ (a — ae — 2) = 421.667 
a 2115 6 > @-»? = 12.667 


By the usual formula, the contribution of the regression to the sum of 
squares for A is (421.667)?/(12.667) (32), or 439. ‘The last divisor, 32, is 
required because the A totals are totals over 32 rats. The sum of squares 
for regression accounts for practically all the sum of squares for A (441), 
so that there is no indication of deviation from linearity. The regression 
coefficient for a single rat is 421.667/ (12.667) (32) or 1.0 unit. The con- 
clusion is that, within the range of periods tested, each additional treat- 
ment period per day increased the weight by 1 unit. 

The B means in table 5.17 do not indicate any consistent effect of 
length of an individual treatment period (1 to 5 minutes). The major 
part of the AC interaction seems to come from the anomalous behavior 
of the faradic current, which produced a drop in weight from a; to ag 
and ag. Since the interaction as a whole was not significant, it does not 
seem worth while to examine the statistical significance of this effect. 


5.33 Other Designs. The total number of treatment combinations in- 
creases rapidly as the number of factors or the number of levels of a fac- 
tor is increased. In this event the latin square necessitates an amount 
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of replication that is usually impracticable, while it becomes difficult to 
assemble homogeneous replications for a randomized blocks arrange- 
ment. Consequently, the experimental error per unit tends to increase. 
In order to avoid this increase in error, a number of designs have been 
produced. The basic principle, which is common to all these designs, is 
the use of a “block” that is smaller than a complete replication. That is, 
the arrangements are such that the differences among these “incom- 
plete” blocks are eliminated from error, just as the differences among 
replicates are eliminated from error in a randomized blocks design. Un- 
fortunately, as will be explained later, this reduction in block size can be 
accomplished only by the deliberate sacrifice of accuracy on certain 
treatment comparisons. The designs fall into three main groups ac- 
cording to the particular treatment comparisons that are sacrificed in 
this way. 

In the first group, the treatment comparisons that are sacrificed are 
the high-order interactions. These designs are appropriate in lines of re- 
search where experience has shown that high-order interactions are 
nearly always negligible. The designs available in randomized incom- 
plete blocks are described in chapter 6 and those which can be placed in 
latin squares in chapter 8. 

In the second growp, known in agriculture as split-plot experiments, a 
factor, or a group of factors and their interactions, are sacrificed, These 
designs, which are very widely used, are discussed in chapter 7. Finally, 
in certain cases it is possible to construct designs so that the loss in ac- 
curacy is spread evenly over all factors and their interactions. Within 
this group there are several types: balanced lattices (chapter 10), bal- 
anced incomplete blocks (chapter 11) and balanced lattice squares 
(chapter 12). In cases where the effects of the factors and the sizes of the 
interactions are rather unpredictable, these designs may be suitable. 
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CHAPTER 6 


CONFOUNDING 


6.1 The Principle of Confounding 


6.11 The 2° Factorial Experiment with Complete Confounding. This 
chapter deals with designs where the size of block is reduced by the 
sacrifice of accuracy on certain high-order interactions. The method 
will be illustrated first for a 2° experiment with three factors, A, B, and 
C, each at two levels. Since there are only 8 treatment combinations, 
the replication is not particularly large and in practice this experiment 
would most frequently be arranged in ordinary randomized blocks or 
perhaps in an 8 X 8 latin square. The example is chosen because of its 
simplicity. 

The interaction of highest order is the ABC interaction. It will be 
recalled (section 5.23) that this interaction is estimated from the com- 
parison 


(abe) + (a) + (b) + (€) — (ab) — (ae) — (be) — (1) 


Suppose that each replicate in the experiment is divided into 2 blocks of 
4 units each, such that one block contains abc, a, b, and c, while the other 
contains ab, ac, be, and (1). With 3 replicates the plan (before ran- 
domization) would be as follows. 


TABLE 6.1 2° Experiment IN BLocks oF 4 units, wita ABC coNFOUNDED 


Rep. I Rep. II Rep. TI 
Block 1 2 3 4 5 6 
abe ab abe ab abe ab 
a ac a ac a ac 
b be b be b be 
c (1) c qd) c (1) 


There are two important properties of this plan. The total of blocks 

1, 3, and 5, minus the total of blocks 2, 4, and 6, is the ABC interaction 

total. Thus the ABC interaction is one of the components of the com- 
154 
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parisons amongst blocks. It is said to be completely confounded with 
blocks. Secondly, the other six factorial effects, A, B, C, AB, AC, and 
BC, will all be found to be orthogonal with the block totals. For in- 
stance, the AB interaction may be written (apart from the divisor) 


[(abe) + (c) — (a) — (b)] + [(ab) + A) — @e) — (be)] 


Of the 4 units in any block, two carry a (+) sign in this expression and 
two carry a (—) sign. Consequently, if we increase all the observations 
in a selected block by any amount, say 50, the estimate of AB remains 
unchanged, and similarly for the other 5 factorial effects. This means 
that these 6 factorial effects are not influenced by differences amongst: 
blocks. Various phrases are used to describe this property: the effects 
may be said to be unconfounded with blocks, or free from block effects, 
or to be composed entirely of within-block comparisons. 

Thus differences amongst blocks of 4 units are eliminated from the ex- 
perimental errors of the main effects and two-factor interactions, whereas 
with randomized blocks only differences amongst blocks of 8 units are 
eliminated, The reduction in effective block size is attained by making 
ABC the same as one of the block comparisons. There is no within- 
block information available about ABC. 

The degrees of freedom in the analysis of variance separate as follows. 


df. 

Blocks : 5 
A, B, CG, AB, AC, BC 6 
Error 12 
Total 23 


All sums of squares are calculated in the usual way, so that there is no 
complexity in the computations. 

The composition of the error s.s. is perhaps worth noting. Consider 
the 3 blocks (1, 3, and 5 in table 6.1) that contain the treatments abc, a, 
b, and c. These may be regarded as a randomized blocks experiment, 
with 4 treatments and 3 blocks. Consequently the interaction of treat- 
ments with blocks contains six components. Similarly in blocks 2, 4, 
and 6 the interaction of the other set of treatments with blocks provides 
six components. These two sets of six components constitute the error 
for the complete experiment. In other words, the error term in a con- 
founded factorial is made up of interactions between treatments and in- 
complete blocks. This remains true in the more complex designs that 
appear later in this chapter, even though in these cases the composition 
of the error is less easy to detect. 
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ABC does not appear explicitly in the analysis above. Actually, the 
experiment, if properly randomized, provides an estimate of error and a 
test of significance for ABC. In practice it is seldom worth while making 
this test, which is usually very insensitive. The test may, however, 
throw additional light on the nature of confounding. Let us ignore all 
other factorial effects and consider table 6.1 as the plan of an experiment 
to determine ABC, i.e., to determine the difference between the group 
of treatments abe, a, b, c and the group ab, ac, be, (1). From this point of 
view the block becomes the “experimental unit” and the experiment is 
of the ordinary randomized blocks type, having 2 “treatments” and 3 
replicates. The total s.s. among the 6 “units” is, of course, the blocks s.s. 
in the previous analysis. Thus the blocks s.s. may be subdivided into: 


df. 
Replicates 2 
ABC 1 
Error for ABC 2 
Total 5 


If confounding has been effective, the error for ABC, being composed of 
comparisons among blocks, will be larger than the error (with 12 d.f.) 
which applies to the rest of the experiment. Moreover, the error for 
ABC is estimated from only 2 d.f., so that the test of ABC is a poor one. 
The test might be worth making with a considerable number of experi- 
ments of the same type, where it is desired to examine the average ABC 
effect over the whole group. 


6.12 The 2° Factorial Experiment with Partial Confounding. Any of 
the seven factorial effects may be confounded with blocks in this way. 
The rest will then be free from block effects. These facts enable us to 
spread the confounding in an experiment among several factorial effects. 
A plan of this type is shown in table 6.2. 


TABLE 6.2 2° experiment IN BLOCKS oF 4 units, with ABC, AC, BC 
PARTIALLY CONFOUNDED 


Rep. I Rep, IL Rep. II 
Block 1 2 3 4 5 6 
abe ab abe ab abe ab 
a ac ac be be ac 
b be b a b 
c (1) qd) c (i) c 


ABC AC BC 
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The difference between the totals of blocks 1 and 2 represents the ABC 
interaction. In replication II, however, the block composition has been 
changed so that the difference between the block totals is the AC inter- 
action. In replication III the difference is the BC interaction. 

With this plan, A, B, C, and AB are entirely free from block effects. 
ABG is completely confounded with blocks in replicate I, but in the other 
two replicates the estimate of ABC is orthogonal with blocks. Thus a 
“within-block” estimate of ABC may be obtained from replicates IT and 
IU. Similarly, a “within-block” estimate of AC is available from rep- 
licates I and III, and one of BC from I and IL. These three effects are 
partially confounded with blocks, since an estimate that is free from 
block effects can still be made for each effect. Each “within-block” 
estimate is derived from 2 out of the 3 replicates in the experiment. The 
ratio 24 serves as a measure of the extent of the confounding. Yates 
(6.1) calls this ratio the relative information on the confounded inter- 
action. The ratio gives the amount of information available on the 
partially confounded effect, relative to that available on an uncon- 
founded effect. 

In the analysis of variance, all factorial effects appear. 


df. 

Blocks 5 
A 1 
EB 1 
Cc 1 
AB 1 
AC at: 
BC V 
ABC V 
Error il 
Total 23 


The sums of squares for blocks and for the unconfounded effects A, 
B, C, and AB are found in the usual way. The sum of squares for AC is 
calculated only from replicates I and III, the divisor for [ACP being 16 
instead of 24. Corresponding rules apply to BC and ABC. In the analy- 
sis of variance, the primes are inserted as a reminder that these effects 
are partially confounded and require special calculations. 


6.13 Confounding in the 2” Series. In order to use confounded de- 
signs intelligently, it is not necessary to understand in detail the methods 
by which they are constructed. The rules for the construction of con- 
founded 2” factorials will be briefly described, however, since they are 
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fairly simple and may help to explain the choice of plans presented at 
the end of this chapter. 

If a replicate is to be divided into 2 blocks, any factorial effect can be 
confounded with blocks. In most cases we would confound the highest- 
order interaction in all replicates. Suppose that each incomplete block 
is to be further divided into 2, so that there will be 4 blocks (I --- IV) 
in a replicate. Having chosen one factorial effect to represent the com- 
parison (I + II — III — IV), we may easily verify that any other fac- 
torial effect may be made to represent the comparison (I — II + III — 
IV). That is, to arrange a 2” factorial in blocks of 2"? units, we may 
confound any two factorial effects that we choose. 

The remaining block comparison (I — II — IIT + IV) represents a 
third factorial effect, which is also confounded with blocks. Barnard 
(6.2) has shown that this effect is uniquely determined by the two ef- 
fects that were chosen. It is always their “generalized interaction,” and 
is found by combining all the letters that appear in the two chosen fac- 
torial effects, and cancelling all letters that enter twice. Thus, if a 2° 
factorial is to be confounded in blocks of 2%, or 8, units, we might be in- 
clined to choose ABCDE and BCDE as the first two factorial effects to 
be confounded. The generalized interaction of these two effects is 
ABBCCDDEE or, after cancellation, is A. Consequently, if these two 
effects are chosen for confounding, the main effect of A is also automati- 
cally confounded. These results may be summarized as follows. “If a 
replicate of a 2” factorial is arranged in blocks of 2”~2 units, three fac- 
torial effects are confounded with blocks. Of these, two may be chosen 
at will: the third is their generalized interaction.” 

To proceed, a plan with 8 blocks in the replicate can be obtained from 
one with 4 blocks in the replicate by dividing each block into two parts. 
Tn making this division we may confound any factorial effect, except one 
of the three that is already confounded. Thus, in the 2° example, where 
ABCDE, BCDE, and A were confounded in the division into 4 blocks, 
we might choose say ACH for the division into 8 blocks. But with 8 
blocks in the replicate, there are seven components of the comparisons 
among blocks, and each of these must represent one factorial effect. 
Hence, in addition to the four factorial effects that are known to be con- 
founded, three others are confounded. These three are the generalized 
interactions of ACE with ABCDE, BCDE and A, or BD, ABD, and CE, 
respectively. 

The general rule should be clear from this case. Suppose that a rep- 
licate in a 2" factorial is to be divided into blocks of 2"—* units, so that 
there will be 2* blocks in the replicate. Then we may select any k fac- 
torial effects to be confounded, subject only to the restriction that none 
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of these must be a generalized interaction of any group of the others. A 
further (2* — & — 1) effects are automatically confounded. These are 
all the effects which can be expressed as generalized interactions of the 
group of effects selected for confounding. 

In the construction of designs for practical use, it follows that the ef- 
fects to be confounded must not be selected without examination of the 
other effects that are automatically confounded. If confounding is to be 
restricted to high-order interactions, we cannot choose ABCDE and 
BCDE in a 2° factorial, since A would then be confounded. In order to 
avoid confounding any main effects or first-order interactions in this case, 
a little trial will show that the only possible choice is two second-order 
interactions and one third-order interaction: for instance ABC, ADE, 
and their generalized interaction BCDE. The plans at the end of this 
chapter contain what appear to be the best choices for general use. 

When the interactions to be confounded have been selected, there re- 
mains the problem of writing out the plan showing the treatment com- 
binations that appear in each block. This may be done by the procedure 
used in the present discussion. The replicate is first divided into two by 
confounding one effect, then into four by confounding a second, and so 
on. ‘The rules which express each factorial effect in terms of the original 
treatment combinations (section 5.24) decide the composition of the 
blocks at each stage. For experiments with a large number of factors, 
arranged in blocks of small size, Fisher (6.3) has given an alternative 
method that is more expeditious [see also Finney (6.4)]. In this method 
a group of 2"—* Jetter combinations is constructed, such that the gen- 
eralized interaction of this group with any treatment combination gives 
the members of the block that contains this treatment combination. 

Fisher (6.3) has examined the general problem of confounding when 
the object is to keep all main effects and first-order interactions com- 
pletely clear of block effects. He has shown that experiments up to the 
2” factorial (128 treatment combinations) may be placed in blocks of 8 
units, while blocks of 16 units can accommodate experiments up to the 
25 factorial (32,768 treatment combinations). 

The preceding discussion applied to a single replication. With more 
than one replicate we may repeat the first replicate, in which case the 
confounding is complete for the interactions involved. Alternatively, 
the effects that are confounded may be changed from one replicate to 
the next, so that some intra-block information is available on all effects. 


6.14 Example of a 2* Factorial Confounded in Blocks of 8 Units. 
Table 6.3 contains the plan and yields of a 24 field experiment on beans 
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TABLE 6.3 PLAN AND YIELDS (BEANS IN POUNDS) OF A 2! FACTORIAL 


EXPERIMENT 

Rep. I Rep. IL 
p k d npk npk d Pp dnk 
Block a 45 55 53 36 Block a 43 42 39 34 
875 dnk dnp dpk n 351 n dnp k dpk 
41 48 55 42 47 52 50 Ad 
dp nk dk pk nk dp (1) np 
Block 6 50 44 43 51 Block b 43 52 57 39 
381 dnpk (1) dn np 395 pk dk  dnpk dn 
44 58 41 50 56 52 54 42 


conducted by the Rothamsted Experimental Station in 1936. The fac- 
tors were 


Dung (D): none, 10 tons per acre. 

Nitrochalk (N): none, 0.4 ewt. N per acre. 
Superphosphate (P): none, 0.6 ewt. PO; per acre. 
Muriate of potash (K): none, 1.0 ewt. KO per acre. 


With 16 treatment combinations in blocks of 8 plots, only one factorial 
effect is confounded in each replication. From the plan it will be seen 
that DNPK, the highest-order interaction, was confounded in each of 
the two replicates. The computations proceed as follows. 

Step 1. Calculate the totals for each treatment combination, the 
block and replicate totals, and the grand total. The total s.s. and the 
sums of squares for replicates and for blocks within replicates should 
present no difficulty. 

Step 2. This deals with the computation of the factorial effect totals. 
Write down each factorial effect total as a linear function of the treat- 
ment combination totals, as shown in table 6.4. The treatment com- 
binations and their total yields are placed in a systematic order in the 
first two columns, and the remaining columns give the factorial effects. 
Although the writing of the table requires some time, the procedure is 
quite simple. When the columns for D and N have been written down, 
the column for DN is obtained as the product of corresponding signs in 
the D and N columns. Similarly, later in the table DNP may be ob- 
tained as the product of the signs of D and NP, or of N and DP, and so 
on. Thus it is necessary to know only the expressions for the main ef- 
fects. 

The factorial effect totals are shown at the bottom of each column. 


Treatment 
combination 


ie) 
d 


n 
dn 


Pp 
dp 
np 


TABLE 6.4 CatcunaTION OF FACTORIAL EFFECT TOTALS IN A 24 EXPERIMENT 


Factorial effect 


DPK NPK DNPK 


yield D N DN 1a DP NP DNP K DK NK DNK PK 
115 - - + = + + + 4 t+ - = + 
be ie ey, ee ea Sg ee Se abr Se ae 
89 = i eee ae Oe eee mer 
iil bape ID Bl eRe ST eer dN ee ee a nr a fe 
Fa 9 a Al 2 a ee reer ane a eee ae a ge 
102 (ore Shee ce opel a a ee et Gal = oe 
89 Se a SES HS oe SE 
100 Ses Fe ed GH ope pe 
105 Siegen Wee Rep ee ete a eae ae ee 
95) Pore, = ek ee ES oe eee 
87 + - ~ ee eS 
(ie oe Oth AR toes, ee aS 
107 ee ee ee ee | 
9 + - =- + + = *= + ae ee ae 
eee ee ee ee 
8 + + t+ oF + + + F + F HF + + 
1502 —§ —102 82 14 88 50 8 12 —14 +82 18° 98 —32 50 


po 
) 
i) 


a9 
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The total for DNPK is not required, since owing to the confounding 
DNPK will not appear explicitly in the analysis of variance. The DNPK 
column, however, allows a check to be made, as follows. If the 15 totals 
are added, it may be verified by inspection of table 6.4 that their sum, 
+66, should equal 16 times the (dnpk) treatment total, minus the grand 
total, ice., 

16(98) — 1502 = +66 


Yates (6.1) gives a different method for obtaining the factorial effect 
totals. His method is completely automatic and does not require the 
expressions for the factorial effects to be written down. The present 
method, though probably not so speedy as Yates’s method, is more 
flexible in that only those factorial effects that are of interest need be 
obtained. Sometimes the experimenter will wish to isolate only the 
main effects and two-factor interactions, previous experience having 
shown that interactions of higher order are likely to be negligible. 

Step 8. The contribution of each factorial effect to the treatments 
8.8. in the analysis of variance is now obtained. Since there are 32 plots, 
the square of each effect is divided by 32. The analysis of variance ap- 
pears in table 6.5. Note that DN PK is omitted, since it already appears 


TABLE 6.5 ANALYSIS OF VARIANCE 


df. 8.8. m.s. 
Replications 1 3.1 
Blocks in reps. 2 123.2 61.6 
D 1 2.0 
N 1 325.1 ** 
fe 1 6.1 
K 1 4.5 
DN 1 32.0 
DP 1 242.0 ** 
NP 1 78.1 
DK 1 6.1 
NK 1 32.0 
PK 1 24.5 
DNP 1 2.0 
DNK 1 10.1 
DPK 1 15.1 
NPK 1 32.0 
Error 14 340.0 24.29 
Total 31 1277.9 


in the blocks s.s. As a check, the total of the 15 treatment s.s., 811.6, 
plus that for DNPK, 78.1, should equal the treatments s.s. as found in 
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the ordinary way from the totals for the 16 treatment combinations. 
The error s.s. is found by subtraction. 

Only two effects are significant: the main effect of nitrochalk, which 
produced a depression of the yield, and the DP interaction. Since 
neither D nor P produced significant main effects, it is advisable not to 
lay much stress on the DP interaction in the absence of other confirm- 
atory evidence. 

Step 4. The best presentation of the results will depend on their 
nature and on the audience for whom they are intended. Table 6.6 gives 


TABLE 6.6 DirreRENTIAL RESPONSES (POUNDS PER PLOT) 


Response with 


Mean 
Factor pees Dung (D) Nitrochalk (NV) | Super. (P) Mur. pot. (K) 


‘Abs. | Pres. | Abs. | Pres. | Abs. | Pres. | Abs. | Pres. 


Dung (D) Sb hives |toste se —2.5| +1.5'| —6.0] +5.0| +0.4 | —1.4 
Nitrochalk (VN) | —6.4 | —8.4] —4.4 |-.---0-]-esee0s -9.5 | —3.2| —4.4| -8.4 
Super. (P) +0.9 | —4.6| +6.4| —2.2 | +4.0].....0.)-...-- 0.9 | +2.6 
Mur. pot, (K) ~o.g | +0.1| —1.6| +1.2 | —2.8 | —2-5 | $1.0 ].......]esecnee 


s.e.: +£2.46 for differential response; +-1.74 for mean responses. 


a concise presentation of the main effects and two-factor interactions in a 
form that has been used frequently at Rothamsted Experimental Sta- 
tion. The response to each factor is shown separately for each level of 
every other factor. Thus the row labelled “Dung”’ contains the mean 
response and the differential responses to dung. The figure —2.5 (re- 
sponse to dung with nitrochalk absent) is the average response to dung 
over all plots that did not receive nitrochalk. The table enables a quick 
appraisal to be made of the nature of the main effects and two-factor 
interactions. In the present case, of course, the table is scarcely needed 
because of the dearth of effects. 

The data in the table are easily calculated from the factorial effect 
totals in table 6.4. For instance, 


Mean response to dung = 7'g[D] = —f, = —0.5 
Mean response to dung 

(nitrochalk absent) = ’5{[D] — [DN]} = —4#8 = —25 
Mean response to dung 

(nitrochalk present) = 5 {(D]+ [DN]} = +76 = +15 
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These relations may be verified from the signs in the columns in table 
6.4. 

The standard error per plot is V 24.29, or 4.93. Since each differential 
response in table 6.6 is the difference between the means of two groups 
of 8 plots, its standard error is (4.93)/2, or 2.46. The standard error for 


& mean response is Vi times this value, or 1.74. 


6.15 Confounding of a 3” Factorial. Let the symbol ij denote the 
treatment combination that has the ith level of A and the jth level of 
B (i,j = 0, 1, 2). As before, we wish to keep the main effects clear of 
block effects, that is, to confound only the AB interaction. Since there 
are 9 treatment combinations in a replicate and since 3 is the only factor 
of 9, the size of the incomplete block must be 3 units. 

We have seen (section 5.26) that the main effects of A and B both con- 
tain two components, while the AB interaction has four components. 
Further, the main effects and the interaction can be divided into single 
components in various ways, and the division that is most appropriate 
for interpretation of the results will change from experiment to experi- 
ment. This suggests that the particular components of the interaction 
which we should desire to confound will vary from case to case. Actu- 
ally, the possibilities for confounding are much more restricted with the 
3" than with the 2” system: in fact, only one set of components of AB 
lends itself readily to confounding. 

The confounding is based on the properties of the 3 X 3 graeco-latin 
square. Suppose that the 9 treatment combinations are set out as in table 
6.7, on which a 3 X 3 square is superimposed. Following the notation 


TABLE 6.7 Use or A 3 X 3 GRAECO-LATIN SQUARE TO OBTAIN THE AB 
INTERACTION 


bo by be 


a | (00)J1 (01)IoJ2 (02)I37s 
a (10) IgJ2 ()hJ3 a ae 
a2, (20) 273 QU IgJ1 (22) J 


used by Yates (6.1), we denote the latin letters in the square by J, Io, 
and J, respectively, and the greek letters by J, Jo, and J. 3, respectively. 
It is clear that comparisons among the row totals of the square give 


the two components of the main effect of A, while comparisons among 
the column totals give the main effect of B. Now consider the J totals: 


I, = (00) + (11) + 22); Ip = (01) + (12) + (20); 
ZI = (02) + (10) + (21) 
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By the basic property of the latin square, comparisons among these 
totals are orthogonal to both rows and columns: that is, to the main ef- 
fects of A and B. Hence, the comparisons among the J totals must 
represent two of the four components of the AB interaction. The same 
argument shows that the two remaining components of AB are derived 
from the comparisons among the J totals, where 


Ji = (00) + 02) + 21); Jag = (01) + 0) + 22); 
Jz = (02) + (11) + (20) 


‘A numerical verification of this fact was made at the end of section 5.26, 
where the sum of squares for AB was calculated from the latin and 
greek letter totals which correspond to the T and J totals. 

The application of this result to confounding is shown in table 6.8. 
There are two possibilities. In plan (i) the J components of AB are com- 
pletely confounded with incomplete blocks, since the block totals have 
been made the same as the J totals. The main effects and the J compo- 
nents of AB are unconfounded. In plan (ii), the J components of AB are 


TABLE 6.8 3 X 3 ExpertMent with AB PARTIALLY CONFOUNDED 


Plan (i) Plan (ii) 
Incomplete blocks Incomplete blocks 
@ i) ii) (i) ii) Gii) 
(00) (01) (02) (00) (01) (02) 
(11) (12) (10) (12) (10) (11) 
(22) (20) (21) (21) (22) (20) 
i Te T3 Jy J Js 
I components confounded J components confounded 


completely confounded. It should be noted that neither the J nor the 
J components are easy to interpret in terms of the results of an actual ex- 
periment. They are selected for confounding because they are con- 
venient for this purpose. ‘ 
If the number of replicates in an experiment is even, we may use plan 
(i) in half the replicates and plan (ii) in the other half. With this ar- 
rangement the J components may be estimated (clear of block effects) 
from those replicates in which plan (ii) is used, and vice versa for the J 
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components. The relative accuracy is 14 for each set of components. 
Tt may be shown that the relative accuracy is also Y for any component 
of AB in which we may be interested. The confounding is said to be 
balanced with respect to AB. Balanced confounding is preferable, for 
there is no reason to confound J more heayily than J. 

This design has limited practical utility. It may be used where the 
factors are known to operate independently, so that AB will be neg- 
ligible. If information about AB is wanted, a randomized blocks design 
is advisable unless there will be a substantial reduction in error variance 
from the use of incomplete blocks. The same principle of construction, 
however, leads to a very useful design for the 3° factorial. 


6.16 Confounding of a 3° Factorial. The same notation will be used. 
Thus, (021) denotes the treatment combination aob2c;. Since there are 
27 treatment combinations, the possible sizes of incomplete block are 
3 and 9 units. The plan for blocks of 3 units necessitates confounding of 
the two-factor interactions and is not given here. With 9 units per 
block, only ABC need be confounded. In the case of the 32 factorial we 
were able to divide the 9 treatment combinations into groups of three 
(the J and J groups), such that comparisons among the group totals 
gave the components of AB. We shall show that the 27 treatment com- 
binations in a 3° factorial can be divided into groups of nine such that 
comparisons among the group totals give the components of ABC, Each 
set of three groups will contribute two components of ABC. Since ABC 
has in all eight components or degrees of freedom, there will be four such 
sets. 

The AB interaction can be calculated separately for each level of C. 
As before, we will obtain the AB interaction from its I and J compo- 
nents. Table 6.9 shows the J components for each level of C. 


TABLE 6.9 Tun J components or AB SHOWN FOR BACH LEVEL or C 


co it C2 


Ti | (000) + (110) + (220) | (001) + (111) + (221) | (002) + (112) + (222) 
Zz | (O10) ++ (120) + (200) | (O11) + (121) + (201) | (012) + (122) + (202) 
Ts | (020) + (100) + (210) | (021) + (101) + (211) | (22) + (02) + (212) 


We may regard table 6.9 as a3 X 3 table, in which each entry is the 
total of 3 treatment combinations. The row totals of the table give the [ 
components of AB, while the column totals give the main effect of C. 
Further, we may take J and J totals from this table just as in table 6.7. 
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These totals will be called J — I, I — Jj, ete., since they come from the 
I components of AB. Thus 
I — Ix: (000) + (110) + (220) + (011) + (121) + (201) + (022) + (102) + (212) 
I — I»: (001) + (111) + (221) + (012) + (122) + (202) + (020) + (100) + (210) 
I — Ig: (002) + (112) + (222) + (010) + (120) + (200) + (021) + (101) + (211) 


By the same argument as for the 3 X 3 design, the comparisons among 
these totals of 9 treatment combinations must represent two of the com- 
ponents of the interaction of AB with C; that is, of the ABC interaction. 

Consequently if we put all treatment combinations in J — J, into the 
first block, those in I — Jz into the second, and those in J — J; into the 
third, we have a plan which completely confounds two of the eight com- 
ponents of ABC, and leaves all other factorial effects unconfounded. 
This arrangement appears as replication III in plan 6.7 for the 3° fac- 
torial at the end of this chapter. 

The second set of three groups of 9 treatment combinations is obtained 
by taking the J totals from table 6.9. 


I — Jy: (000) + (110) + (220) + (012) + (122) + (202) + (021) + (101) + (211) 
I — Ja: (001) + (111) + (221) + (010) + (120) + (200) + (022) + (102) + (212) 
I — Js: (002) + (112) + (222) + (O11) + (121) + (201) + (020) + (100) + (210) 


This grouping constitutes replication IV in plan 6.7. The remaining 
sets are obtained by forming a 3 X 3 table similar to table 6.9 for the J 
components of AB at each level of C. The reader may verify that the 
J —I,, J — Io, and J — Iz groups are as shown in replicate I and the 
J —J1,J —Jo, and J — Jz groups in replicate I of plan 6.7. 


6.17 Example of a 3° Factorial Confounded in Blocks of 9 Units. 
This experiment, conducted by the Seed Laboratory, Iowa State Col- 
lege, tests the effects of 3 levels of nitrogen, 3 of phosphorus, and 3 of 
potash on the germination of lettuce seedlings. The seed was thoroughly 
mixed, and divided into 108 samples of about 60 seeds each. Each 
sample was planted in a copper box, 6 in. square and 144 in. deep, ina . 
mixture of soil and sand. The boxes were placed in a germinator, at a 
temperature of about 32°C. At the end of 5 to 7 days the seedlings 
were classified as normal, abnormal, hard, or dead. The data in table 
6.10 show the numbers of normal lettuce plants. 

There were 4 replications, each placed on a different shelf in the ger- 
minator. On a shelf the boxes were placed in 3 columns of 9 boxes each, 
each column being an incomplete block. It will be noted that all 3 fer- 
tilizers had a deleterious effect on emergence. 
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The computations proceed as follows. 
Step 1. Form the block and replicate totals and the grand total 


(shown in table 6.10) and also the totals for each treatment combination 
and the three two-way tables (shown in table 6.11). 


Step 2. These data enable us to calculate the total s.s. and the sum 
of squares for replications, blocks within replications, and for the N, 
P, K, NP, NK, and PK factorial effects. All are obtained in the usual 
way and are entered in the preliminary analysis of variance (table 6.12). 


TABLE 6.12 ANALYSIS OF VARIANCE FoR A 3 X 3 X 3 FACTORIAL 


af. 8.8. m.s. 
Replications 3 2,041.88 
Blocks within replications 8 5,008.15 626.02 
N 2 1,016.67 508.34 ** 
i. 2 917.39 458.70 ** 
K 2 293.39 146.70 
NP 4 399.27 99.82 
NK 4 589.61 147.40 
PK 4 212.89 53.22 
NPK: confounded in replications 
i 2 25.21 12.60 
2 2’ 64.22 32.11 
3 2’ 6.39 3.20 
4 2° 198.30 99.15 
Error 70 4,146.88 59.24 
Total 107 14,920.25 


Subdivision of part of the treatments s.s. 


Ns 2 1 1,012.50 ‘1,012.50 ** 
Q 1 4.17 4.17 
P: OL 1 917.35 917.35 ** 
Q 1 0.04 0.04 
Ke oD 1 284.01 284.01 * 
Q 1 9.37 9.37 
NP: LXL 1 184.08 184.08 
Rest 3 215.19 71.73 
NK: LXL 1 256.69 256.69 * 
Rest 3 332.92 110.97 
NPK:LXLXL 1’ 59.12 59.12 


Notice that we do not calculate the total s.s. for treatments (with 26 
d.f.), since this contains part of the blocks s.s. 
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Step 3. There remains the calculation of the contribution from the 
NPK interactions. Before doing this, it may be remarked that some- 
times, from the nature of the factors or from previous experience, there 
is good reason to believe that the three-factor interactions will be neg- 
ligible. In this case the experimenter may decide to pool the sum of 
squares for NPK with the error s.s. without troubling to compute the 
sum of squares for NPK. The pooled error would have 78 d.f. and would 
be obtained by subtraction of the sum of squares calculated in step 2 
from the total ss. The danger from this procedure is that if three- 
factor interactions are present the experimenter may not detect them, 
and moreover his estimate of error will be inflated. For many types 
of field experimentation in agriculture the procedure seems reasonably 
safe. When there is doubt it is better to isolate NPK. 

Consider the two components of NPK that are confounded in rep- 
lication 1. The contribution of these components to the sum of squares 
for NPK must be calculated from the remaining 3 replicates, in which 
they are unconfounded with blocks. The totals needed are shown in 
table 6.11 under the heading “NPK components.” From the treat- 
ment totals, compute the total (944) of the 9 treatment combinations 
that appear in block 1A. Thus 


Block 1A = 012 + 122 + 220 + 202 + 101 + 021 + 000 + 110 + 211 
944= 119+ 81+ 924+ 96+ 82+ 99+ 171+ 118+ 86 


In the same way we obtain 1102 for the total over all treatments that 
appear in block 1B and 1131 for block 1C. Underneath these figures 
we place the respective totals for blocks 14, 1B, and 1C. By subtraction 
we obtain the totals 773, 748, and 737. These are the totals of the groups 
of treatment combinations taken over replications 2, 3, and 4. The sum 
of squares of deviations of these quantities from their mean is divided 
by 27, since each figure contains 27 observations, The result, 25.21, is 
the contribution of the two components of NPK to the sum of squares 
for NPK. The remaining six components are found similarly from rep- 
lications 2, 3, and 4. Of course, all eight components could be computed 


in one step as 
aiy[(778)?+ (748)?+ - - - + (701)?-+(771)?] —gip[(2258)?+ - - - + (2274)?] 
The error s.s., with 70 d.f., is calculated by subtraction. 


Step 4. In an experiment of this type, where each factor has equally 
spaced levels of an ingredient, it is usually advisable to examine the linear 
and quadratic components of the response curves. The contributions 
to the treatments s.s. are displayed in the lower section of table 6.12. 
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The method of calculation was described in section 5.26, where the data 
for N and P in this experiment were used as an example. All three fer- 
tilizers show significant linear responses, with no indication of any de- 
parture from linearity. 

With regard to the two-factor interactions, the “linear by linear” com- 
ponents have been isolated for NP and NK (see section 5.26 for the 
method). For PK it was not thought worth while to do this, since the 
total s.s. for PK, 212.89, is not large enough to allow any single compo- 
nent to be significant. The “linear by linear” component is significant, 
for NK but not for NP. 

Where the “linear by linear” components of the two-factor interactions 
are large, it is desirable to isolate the “linear by linear by linear’’ compo- 
nent of the three-factor interaction. This will be done as an exercise. 
First we must define this component. With two factors, say N and P, 
the N;Pz component has been defined as 

(22) + (00) — (20) — (02) 
where the numbers refer to the levels of N and P. The value of this 
quantity at the highest level of K, minus its value at the lowest level of 
K, measures the linear effect. of K on NV. 1Py, and is the N,P,Kz compo- 
nent in question. Algebraically, it is 
TL = (222) + (200) + (020) + (002) ~ (022) — (202) — (220) — (000) 
The estimate from the treatment totals will be found to be —27. 

Since three-factor interactions are partially confounded with blocks, 
this component js also partially confounded and must be adjusted so as 
to remove block effects. The nature of the confounding is seen by writing 
down the expression above in the original table of plot yields (table 6.10). 
This has been done by means of the + and — signs that appear to the 
left of the seedling numbers in the table. It is evident that L is orthog- 
onal to the totals of blocks 1C, 2B, 3C, and 4C, since each of these 
blocks has one + and one —. In blocks 1B, 2C, 3B, and 4B, L has three 
+ signs, while in the remaining blocks it has three — signs. Hence, a 
quantity that is free from block effects is 

3L — (1B) — (2C) — (8B) — (4B) + (14) + (2A) + (84) + (44) 
or 
3(—27) — 354 — 134 ~ 232 — 290 + 171 + 308 + 197 + 302 = —113 
the block symbols denoting block totals. "This expression, apart from a 
divisor, may be shown to be the least squares estimate of NiP Ky, ad- 
justed for block differences. 

If this quantity is expressed algebraically ag a linear function of the 
plot yields, the sum of squares of the coefficients is found to be 216, The 
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contribution of N;PzKy, to the treatments s.s. is therefore (118)?/216, 
or 59.12, which is practically the same as the error m.s. If the number 
of replicates differs from four, the procedure for isolating the “linear by 
linear by linear” component remains the same, except that the divisor for 
the square becomes 54r instead of 216. 


Step 5. This concerns the presentation of the results, Usually it 
will be sufficient to show the three two-way tables of means, which are 
derivable from the two-way tables of totals (in table 6.11) on division 
by 12. Since all main effects and two-factor interactions are uncon- 
founded, standard errors and t-tests for the 3 X 3 tables are obtained 
just as in a randomized blocks experiment. ‘The principal results are 
that each fertilizer has produced a significant decrease in the numbers 
of seedlings that emerged, the decrease being substantially proportional 
to the amount of dressing. The significant NK, interaction represents 
the fact that the decrease in emergence from ng to no was smaller at the 
kg level than at the ko level. There is some indication of a similar effect 
with V and P, though this did not attain significance. : 

The individual 27 treatment totals or means cannot be used as they 
stand for interpretative purposes, since they contain some block effects, 
A table of these totals or means will probably be unnecessary unless some 
aspect of the three-factor interaction requires study. To obtain such a 
table, we adjust each total so as to remove block effects. Each block 
effect is first estimated. For this purpose we do not use simply the ob- 
served block mean, since that in turn contains treatment effects. The 
least squares estimate of any block effect is 


z7l4(block total) — (total of treatments appearing in the block)] 


The basic data needed are available in the section headed “NPK com- 
ponents” in table 6.11. Thus, for block 14, the estimated effect is 


wy[4(171) — (944)] = —9.6 
The block effects are given below. 


Block 
Replication A B Cc 
1 o.6 +11.6 +16.5 
2 +4,2 —4.0 15.1 
3 —8.8 —5.4 —5.4 
4 +3.9 +6.3 +6.0 


To adjust any treatment total, we note from the plan the 4 blocks in 
which it appears, and compute the sum of the effects for these 4 blocks. 
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This quantity is subtracted from the unadjusted total to give the ad- 
justed total. Thus for npiko, which appears in blocks 14, 2A, 3B, and 
4B, the adjusted total is 


118 — [—9.6 + 4.2 — 5.4 + 6.3] = 122.5 


To obtain the adjusted mean we divide by 4 as usual. 


6.18 Mixed Series: the 3 x 3 X 2 Factorial. In the designs discussed 
in previous sections, all factors have the same number of levels. When 
the number of levels differs from one factor to another, the utility of 
confounding is limited. Usually some two-factor interactions must be 
confounded, and the computations become more laborious. Only five 
designs of this type are given at the end of this chapter, though a number 
of others are available in the literature. As an example, we will consider 
the confounding of a3 X 3 & 2 factorial, which has 18 treatment com- 
binations. 

The main effects of a factor will be kept clear of block effects if every 
block contains each level of the factor the same number of times. Thus 
the factor A, which occurs at 3 levels (0, 1, 2), is unconfounded if every 
block contains an equal number of O's, 1’s, and 2’s. Consequently, block 
size has to be a multiple of 3. Similarly we can keep the main effects of 
B (0, 1, 2) unconfounded if the block size is a multiple of 3. For the 
factor C at 2 levels (0, 1), the block size must be a multiple of 2. Hence, 
in order to keep all main effects clear of blocks, the only feasible block 
size is 6. 

Further, with 6 units in a block, every possible combination of the 
levels of A and those of C can appear once in every block, so that we may 
expect to be able to keep AC, and likewise BC, wnconfounded. We can- 
not place all the 9 combinations for AB in a block, so that AB will be 
partially confounded. 

From this approach the plan can be constructed rather easily. Every 
block is to contain the 6 combinations (00), (10), (20), (01), (11), (21) 
of Band C. The only question is the manner in which the 3 levels of A 
are combined with the 6 pairs above. This allocation is to be such that 
all 6 combinations of AC appear in the block. That is, the 0, 1, and 2 
levels of A must each appear with the 0 level of @ and each with the 1 
level of C. We may impose one additional rule, designed to confound 
AB as little as possible. Although we cannot represent all 9 combina- 
tions of AB in a block, it will be well to represent as many as possible. 
Thus we introduce the restriction that any AB combination (e.g., 12) 
must not appear more than once in a block. 

Under these rules only four types of replicate can be made up, as shown 
in table 6.18. 
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Consider, for instance, in how many ways the block containing (000) 
-can be constructed. The first three A levels (reading from the top in 
table 6.13) must be either 0, 1, 2 or 0, 2, 1, so that each A level shall ap- 
pear with the 0 level of C. If 0, 1, 2 is chosen, the remaining three A 


TABLE 6.13 Possrpue BLocks For A 3 X 3 X 2 FACTORIAL 


Level of A 


BC] Ia Ib Ie | Wa Wb We | Wa Ib Wie} IVa IVb IVe 


NerONrF SO 
HRROOCO 
FONCNH 
NrOorFOCNW 
ovnenwro 
CNrRrRONW 
rPOoNNFO 
Nr OCOONF 
ornNnNNO- 
FNoorD, 
NOoOrFrRNO 
Nor OrN 
CorNeFENO 
BENONOH 


levels must be either 1, 2, 0 or 2, 0, 1, these being the only sequences that, 
make no AB combination appear twice. These sequences give blocks 
Ic and IIb respectively. Similarly, the choice of 0, 2, 1 for the first three 
A’s leads only to blocks III¢ and IVb. 

In the same way, we find that there are only four possible blocks con- 
taining the treatment combination (001) and four containing (002.) 
The three sets of 4 blocks can be grouped into 4 separate replications 
(I, IL, III, and TV in table 6.13). 

As Yates (6.1) has shown, the plan can be written in a more con- 
densed form (table 6.14) which exhibits the nature of the confounding. 


TABLE 6.14 ConpENSED FORM OF THE PLAN FOR A 3 X 3 X 2 CONFOUNDED 
FACTORIAL 


Blocks 


Ig Te Io | Ile Ue IW | Tilo Tt, I, | Ve IVs IVe 


0 Ie Ig Ty | Ip Ty “lo | Jag Js Jy | Js Jr Je 
at Tg) Eq. FIGs ds |S a Fa) Sa so 


For example, in block I, the 3 combinations that appear with the 
zero level of C are (01), (12), and (20). This is the group that forms the 
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Iz component of AB (see section 6.15). Tt follows that in the first 2 
replicates the J component of AB is partially confounded; the J compo- 
nent can be verified to be unconfounded. In replicates III and IV the 
J component is partially confounded, while the J component is clear of 
blocks. ABC is partially confounded in all replicates. The relative in- 
formation on AB has been shown to be 7¢ and that on ABC, 54. 


6.19 Numerical Example of the 3 X 3 X 2 Design. Table 6.15 shows 
the plan and analysis of variance for an experiment on the response of 
young tung trees to fertilizers. The results of this experiment have been 
reported by Merrill, Kilby, and Greer (6.17). The general object was to 


TABLE 6.15 Dara ror 3 X 3 X 2 FACTORIAL EXPERIMENT IN BLOCKS OF 6 UNITS 


la Ib Te Ila IIb Ile 
100: 80 200: 78 000: 38 200:136 000: 43 100: 89 
210: 86 010: 55 110: 73 010: 56 110: 81 210: 87 
020: 70 120: 82 220: 75 120: 64 220: 90 020: 66 
201: 74 O01: 67 101: 78 101: 95 201: 81 001: 91 
Oll: 82 111: 67 211: 51 211: 76 O11: 61 111: 97 
121: 86 221: 57 021: 66 021: 71 121: 65 221: 60 
478 406 381 1,265 498 421 490 1,409 
Illa IIb Iie IVa IVb IVe 
100: 86 200: 73 000: 66 200: 88 000: 53 100: 81 
010: 79 110: 76 210: 85 110:107 210: 66 010: 58 
220: 73 020: 97 120:101 020: 70 120: 92 220: 56 
201: 97 001:116 101:117 101: 79 201: 88 001: 90 
111: 79 211: 86 011:106 O11: 92 111:109 211: 68 
021:1138 121: 81 221:102 221: 96 021: 95 121: 67 
527 529 577 1,633 532 503 420 1,455 


Analysis of variance 


Source d.f. 8.8. m5, 
Replications 3 3,837 1,279 
Blocks within replications 8 2,836 354 
Levels of 8-8-6 in row (A) 2 1,116 558 
Meals (B) 2 254 127 
Levels of 8-8-6 side-dressed (C’) 1 868 868 * 
8-8-6 in row X meals (AB) 4’ 1,129 282 
8-8-6 in row X 8-8-6 side-dressed (AC) 2 2,995 1,498 ** 
Meals by 8-8-6 side-dressed (BC) 2 424 212 
8-8-6 in row X meals X 8-8-6 side-dressed (A BC) 4 1,016 254 
Error 43 8,909 207 
Total 71 28,384 
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discover the type of fertilizer application that would stimulate the early 
growth so that the young trees would be ready for transplanting to com- 
mercial orchards by early autumn. The data shown are the mean heights 
in centimeters of the 12 tung trees on each plot. For convenience in 
following the computations, the plot yields follow the same order as in 
table 6.13, Note that factor A appears last in table 6.13 and first in 
table 6.15. 
The three factors were as follows, all dressings being per acre. 


A B Cc 
8-8-6 fertilizer applied 8-8-6 fertilizer applied 
in the row as side-dressing 
0 None 0 None 0 None 
1 200 lb. 1 Tung meal (650 Ib.) 1 200 lb. 
2 400 Ib. 2 Cottonseed meal (650 lb.) 


The side-dressing was placed in a shallow furrow on one side of the 
row about 8 inches from the tung seedlings. Factors A and B were ap- 
plied at planting (March 13), factor C on June 3, and the height measure- 
ments were made on October 23. It should be noted that all three fac- 
tors supply each of the common plant nutrients (V, P, and K). 

In an experiment of this type, one might anticipate that two-factor 
interactions will be present. If, for example, 400 Ib. of fertilizer applied 
in the row supply all the nutrients that the young tree can utilize, plots 
that receive this dressing will show no response to either factor B or C, 
On the other hand, plots that receive the zero level of A may respond to 
the dressings supplied in factors B and C. It would not be wise to con- 
found any two-factor interactions heavily. Since, however, the soils 
available for experiments had been found to be very variable, it was de- 
cided to arrange the experiment in blocks of 6 plots. As we have seen, 
this will partially confound AB, though as the relative information is 
high (7g), not much information is lost on this comparison even if the 
incomplete blocks prove ineffective. 

Table 6.16 shows the computation sheet leading to the analysis of 
variance. The steps are as follows. 

i. Calculate the block totals (denoted by Ia, Ib, ete., and shown in 
table 6.15), and the treatment totals (table 6.16). Calculate the total 
s.s. and the blocks s.s. The latter has been divided into two com- 
ponents—replications (3 df.) and blocks within replications (8 d.f.). 
This separation enables us to estimate the decrease in error variance due 
to the reduction in block size from 18 to 6 plots. The analysis of variance 
appears at the foot of table 6.15. 

ii. Form the two three-way totals as shown. From the margins of 
the table for (c, + ¢o), calculate the sum of squares for the main effects 
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TABLE 6.16 Computation sHenT ror 3 X 3 X 2 EXPERIMENT, BLOCKS oF 
6 UNITS 
(i) Treatment totals 
co ey 


bo by be bo br be 
a 200 248 303 364 341 345 
a 336 337 339 369 352 299 
a2 375 324 294 340 281 315 
(ii) Two-way tables 
a1 +c C1 — Co 


bo by be Totals bo by be Totals 
a) | 564(1)  589(4) 648(7) «1,801 | +164(1) +93(4) +42(7) 299 
a, | 705(2) 689(5) — 638(8) 2,032 | + 33(2) +15(5) —40(8) 8 
ay 715(3) 605(6) 609(9) 1,929 | — 35(8) —48(6) +21(9) —57 


Totals| 1,984 1,883 1,805 5,762| 162 65 23 250 
(iii) From (¢1 + eo) 
Ti = (1) + (5) + (9) = 1,862 Ji = (1) + (6) + (8) = 1,807 
Iq = (2) + (6) + (7) = 1,958 Jo = (2) + (4) + (9) = 1,903 
Ts = (8) + (4) + (8) = 1,942 Js = (3) + (5) + (7) = 2,052 
5,762 5,762 
20’ = 2h, +Ja+Ia = 4,700 Qi! = 2, +a +1Va = 4,673 
2Ie' = 2a +1, +1, = 4,743 Qe! = 2Jo +1 +1V, = 4,838 
21s’ = 203 +1, +11. = 4,755 Qs = 2J3+ II, +1Ve = 5,101 
14,198 14,612 
2 tila 2 2 2 
RUM ten AR = (4,700)? + ent + (5,101) “ (14,198) en = 1,129 


(iv) From (e1 — ¢9) 
Ri = +200, Ro = +32, Rs = +18; Si = +81, Se = +147, S3 = +22 
where, e.g., Ry = (1) + (5) + (9), ---, Ss = (8) + (5) + (7) 


2Ry = 2k; —l, +1. + Ty — I, = +306 
2Ry! = 2R2 + 1a — I. — Ua + Il, = +153 


2Rs’ = 2R3 — Ta +15 + Ua — Ty = + 41 
+500 
287’ = 2S; — Illy + II, + IV, — IV, = +293 
2Se! = 282 + IIa — IW, — IVa + IV, = +132 


283’ = 283 — IIa + WI, + IVa —IVe = + 75 


+500 


(4-806)? ++ (4153)? +--+ (475)? _ (4-500)? 


s.s. for ABC = a 0 


= 1,016 
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of A and B. From the total of the table for (cy — co), we calculate the 
sum of squares for the main effect of C: this is (250)?/72, since there are 
72 plots. Similarly, the margins of the table provide the sum of squares 
for the AC and BC interactions, which being unconfounded are obtained 
in the usual way. 


iii. These steps lead to the sum of squares for AB. First calculate the 
I and J components of AB, which are obtained by summation from the 
(c; + ¢o) table in step ii, as shown in table 6.16 (iii). Then apply the ad- 
justments for block effects. Since J, for instance, does not appear in 
blocks Iq or IIa, the adjustments involve the totals for these two blocks. 
The adjustments lead to the quantities 2/,’, ete., from which the sum of 
squares for AB follows as shown. 


iv. These steps lead to the sum of squares for ABC. From the table 
for (cy — ¢o), step ii, form totals R; and S; which are calculated by the 
same rules as for the J’s and J’s, respectively. These totals are then ad- 
justed for block effects, yielding quantities 2R;’, 28,’, etc., from which 
the ABC s.s. is easily obtained. The error s.s. is then computed by sub- 
traction. 


vy. Summary and presentation of results. Consider first the con- 
clusions indicated by the analysis of variance. The only significant mean 
square for a main effect is that for C (side-dressing). The AC interaction 
is significant at the 1% level, while the mean square for A, though not: 
significant, is substantially above the error m.s. Neither B (meals) nor 
any of its interactions approaches significance. These results suggest 
that the AC two-way table should be examined. 

Since table 6.17 contains no confounded effects, the means per plot 
are found simply by addition and division from table 6.16, (i), It is 


TABLE 6.17 AC two-way TABLE 


Mean heights (centimeters) + 4.15 
Row dressing pide ireee Ds 
co C1 
None 200 Ib. 
a None 62.6 87.5 
a 200 lb. 84.3 85.0 
az 400 Ib. 82.8 78.0 
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evident that there are large and significant responses to the 200 Ib. dress- 
ings, both when applied in the row and as side-dressing. The means, 
84.3 for the row application and 87.5 for the side-dressing, do not differ 
significantly, so that the applications appear to have been about equally 
effective. There is no further increase to the 400 Ib. row dressing, or to 
mixtures containing both types of dressing. Thus the large AC inter- 
action arises primarily because each dressing is effective only in the 
absence of the other. 

The analysis of variance would indicate that no definite effects of the 
meals (B) can be established. As an exercise, the AB two-way table, 
which involves partially confounded effects, will be examined. The 
means must be adjusted for block effects. This could be done by first 
calculating a mean for each incomplete block, adjusted for treatment 
effects, and then adjusting each AB mean for the 4 blocks in which it 
lies. It is slightly quicker to perform the adjustments by means of the 
I and J effects, as follows. 

In table 6.16, (iii), divide the quantities 27,’ , ete., by 42, and calculate 
the deviations from their mean. The quantities 2J,’, etc., are treated 
similarly. The same procedure is carried out for the I’s and J ’s, except 
that. the divisor is 24. This gives 
Y= 02 W=905 f= —47, fol =—08, f= 54 
n= —24 n= 16 = 09 f=-47, fp=—07, fe 55 
Sa = +18, t= —14, th=—-04 d= 00, d= -01, 3 = —0.1 

The unadjusted mean of (agbo) is 70.5. From table 6.16, (ii) and (iii), 
we see that (aobo) belongs to I, and J;. Hence its adjusted value is 


(obo)’ = (aobo) + 841 + 5/1 = 70.5 +16 +0.0 = 72.1 


By the same principle, the adjustment to (a9b;), for instance, is 
* (82 + 91). The complete two-way table is shown below. 


iY = -08, i 


TABLE 6.18 AB two-way TABLE, ADJUSTED FOR BLOCK EFFECTS 


Mean heights (centimeters) 


Row dressing 


bo by be 
None Tung Cottonseed 
ay None 72.1 73.1 79.5 
a 200 lb. 86.6 87.6 79.4 
az 400 Ib. 88.9 74.2 77.6 
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The standard error of a single figure in the table is V 207/8, or 5.09, 
since each figure is the mean of 8 plots. This standard error applies to 
t-tests of unconfounded effects; i.e., of the main effects of A and B. For 
t-tests of components of the AB Eeraoiier! the figure must be multiplied 
by V8/7, or 1.069, the value 8/7 being the inverse of the relative effi- 
ciency on AB, 

For ¢-tests of quantities that are a mixture of main effects and inter- 
actions, the multiplier lies between 1.069 and 1. It will usually be 
sufficiently accurate to use the multiplier 1.069. If desired, the correct, 
standard error may be obtained by means of a rule which will be given 
without proof. The quantity whose standard error is wanted is expressed 
in terms of components of main effects and interactions. Suppose that 
we wish to test the response to cottonseed when no row fertilizer is 
applied. The response is (79.5 — 72.1), or 7.4 cm. The approximate 
standard error is 5.09 X 1.069 X +/2, or 7.69. To obtain the correct 
value, we write symbolically, ; 


Sib 2) (be — bo)(2a9 — ay — 
G-mee@ orate S 0) ( 7 a — da) 


Tf all means are unadjusted means, this is an algebraic identity. The 
first term on the right is a component of the main effect of B, while the 
second is a component of the AB interaction. Further, by a property of 
least squares, the equation remains valid if means adjusted for block ef- 
fects are substituted for unadjusted means. By the rule for the variance 
of a linear function, the variance of the first term on the right is 67/9, 
where o” is the variance of a single figure. By the same rule, the variance 


of the second term is 
12\ /8 
log 
9/7 \7 


the factor 8/7 being introduced because the second term is a component 
of AB. The two terms are orthogonal, so that the estimated standard 
error of (b2 — bo)ao is 
5.09V§ + G28) = 5.09V2.190 = 7.53 
Adjustment of the 18 individual treatment means is more tedious. In 
table 6.16, (iv), we divide the quantities 2/1’, etc., by 30, and the quan- 
tities R1, etc., by 24, thereafter taking deviations from the mean of each 
trio. This gives 
my = 44.6, ro! = —0.5, 7x = —4.2 oy = $4.2, se’ = —1.2, 8p’ = —3.1 
n=448, m=-22, m=—-27 #=—01, 2 =+26, = —26 
6ry = —0.2, br = 41.7, 6r3 = —1.5 és; = 14.8, Oso = —3.8, 683 = —0.5 


182 CONFOUNDING 6.19 


In this experiment there is little point in adjusting all 18 treatment 
means. However, the unadjusted treatment totals, table 6.16, (i), sug- 
gest that there might be an effect of the meals (factor B) when applied 
alone. Hence it may be of interest to test the difference between the 
adjusted means of agbscy and agboco. 

From table 6.16, (iii) and (iv), we see that agboco belongs to 7, 41, 71, 
and s;. Since the unadjusted mean is 50.0, the adjusted mean is 


50.0 + 6%, + 67, — dry — 63; = 
50.0 + 1.6 + 0.0 + 0.2 — 4.3 = 47.5 cm. 


Note that the terms in ér and 6s carry a minus sign whenever the treat- 
ment is at the co level, and a plus sign when the treatment is at the c, 
level. For agb2¢o, which belongs to ig, js, 72, and 3, the adjusted mean 
will be found to be 73.1 em. 

In order to obtain the standard error of the difference (73.1 — 47.5) 
= 25.6, this must be expressed in terms of components of main effects 
and interactions. This requires some practice, but the method is not 
hard to master. Consider the following table of values of (bz — bo), 
averaged over the 4 replicates. 


(bz — bo) 
a a a2 | Total 


Total! n o P q 


The quantity whose standard error we wish is d, Now it is easy to 
verify that 
6d=q-+(l—m) + (2n—o—p) +[(2d—e—f) — (29-h—h)] 
components of BBO AB ABC 


As before, this is an identity for the unadjusted means and remains valid 


for adjusted means. Hence by the same rule as before 
2 


3602, = — E + 12+ 24 () +24 | = 22.460" 
4 7 5 


where o? is the estimated variance per plot. The extra factor 8/7 is used 
for the AB component and the factor 8/5 for the ABC component, for 
which the relative information is 5/8. Consequently, the estimated 
standard error of d is 


22.46 X 207 


= 11.4 cm. 
36 
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It follows that the t-value for the response to cottonseed when applied 
alone is 25.6/11.4, or 2.25, with 43 d.f., which is significant at the 5% 
level. 

These results illustrate a feature that is common to the mixed series of 
confounded factorials. If the examination of the results is directed 
mostly at simple rather than factorial effects, the analysis is likely to be 
time-consuming because of the adjustments needed to remove block ef- 
fects and to construct standard errors by the method given above. It 
will also be apparent that considerable experience in the analysis of 
variance, or access to expert advice on this topic, is advisable. 


6.2 The Use of Confounded Designs 


6.21 General Recommendations. In order to make efficient use of 
confounding, it is necessary to consider whether the advantage is likely 
to outweigh the disadvantages. 

The advantage comes from the reduction in the experimental error 
by the use of a block which is more homogeneous, or which can be sub- 
jected to a more uniform technique, than the complete replicate. With- 
out some idea of the amount of this reduction, a realistic decision cannot 
be made on the question of confounding. If uniformity data have been 
collected, the experimenter may compare the variability within the in- 
complete blocks with the variability within replicates. In addition, 
from the results of an experiment in which confounding has been em- 
ployed, it is usually possible to estimate what the experimental error 
would have been if confounding had not been used. The method of cal- 
culation, which was first outlined by Yates (6.5, p. 215), is given in sec- 
tion 6.35. In many types of research the reduction in error from con- 
founding varies greatly from experiment to experiment. In such cases 
little reliance can be placed on the results of a single trial; a summary of 
a moderately large group of experiments is necessary. 

The disadvantages of confounding consist of (1) the reduction in 
replication on the confounded treatment comparisons, and (2) in some 
cases, a greater complexity in the calculations. With regard to the first, 
the experimenter should take into account the importance of the inter- 
actions that must be confounded and the extent to which they are con- 
founded. No interaction should be completely confounded unless there 
is good reason to believe, either from previous experience or from the 
nature of the factors to be tested, that the interaction will be negligible. 

As an illustration of the type of reasoning that might be employed in 
reaching a decision, consider the 3 X 3 X 2 design discussed in the pre- 
vious section. The incomplete blocks contain 6 units, with 7/8 relative 
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information on AB and 5/8 relative information on ABC. ABC is ex- 
pected to be negligible; AB, however, is of sufficient interest so that con- 
founding is undesirable if it involves any loss of precision for this com- 
parison. The issue resolves itself into the question: is the error variance 
per unit with blocks of 6 units likely to be less than 7/8 of the error vari- 
ance with blocks of 18 units? If so, the AB interaction is estimated more 
precisely from the confounded design than from randomized blocks, be- 
cause the reduction in variance per unit more than compensates for the 
loss of replication due to confounding. Since the relative information is 
useful for this type of argument, it is given in table 6.19 for all the plans 
presented at the end of this chapter. 

The increase in the amount of computation varies both with the type 
of design and with the type of analysis needed to summarize the results. 
Where all the confounding is complete, the statistical analysis is prac- 
tically the same as with randomized blocks. With all factors at two or 
three levels (i.e., in the 2” and 3” series), the extra computations are not 
formidable even when partial confounding is used; they are much more 
troublesome when the number of levels is not the same for all factors, 
as in the 3 X 2? and 3 X 2° designs. The extra work consists in the cal- 
culation of adjustments to the partially confounded comparisons so that 
their sums of squares in the analysis of variance are freed from block 
effects. If it is desired to present tables of mean yields which involve 
confounded comparisons, the entries in the tables must be adjusted so 
a8 to remove block effects. This point should be remembered, especially 
with designs where two-factor interactions are confounded. If t-tests of 
simple effects are needed, special computations must be made to obtain 
the standard errors for these tests. Moreover, if part of the data is miss- 
ing, the analysis tends to become much more complicated than with 
randomized blocks. 

The experimenter who has not previously used confounding and who 
does not have access to expert statistical advice is advised to confine him- 
self at first to the simplest types of confounding. He should make sure 
that he understands the method of analysis before the experiment ts con- 
ducted, After the experiment is completed, he should investigate the ip- 
crease in precision that has been achieved by confounding, as a guide to 
the future use of the technique, 

Yates (6.5) has pointed out that in certain circumstances a confounded 
design might give rise to a misleading interpretation of the results. This 
can pe illustrated by means of the 2° factorial in table 6.1 (p. 154). This 
has three replicates, with A BC completely confounded in each replicate. 
Suppose that the response to A is much larger than any other effects, 
and that this response varies greatly from block to block. That is, the 
interaction of A with blocks is much larger than the interactions of the 
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other factors with blocks. Now the factorial effect total for BC may be 
written 


BC = (abe) + (be) + (a) + (1) — (ab) — (ac) — (0) — (@) 
= [(abe) + (a) — (6) — (©)] — [(@b) + (ae) — (be) — (1)] 
= response to A in blocks 1, 3, 5 — response to A in blocks 2, 4, 6 


The point is that the factorial effect of BC is a component of the inter- 
action of A with blocks. When tested against the error m.s. (12 df.) 
this response might be significant, not because there is an effect of BC, 
but because A has large interactions with blocks. 

Another way of describing the source of the trouble is to say that the 
error m.s. is not homogeneous, in that the interaction of A with blocks 
is larger than the interactions of other treatments. If this were sus- 
pected, the danger of a misleading interpretation could be avoided by 
subdividing the error m.s. We can isolate 4 d.f. (two from blocks 1, 3, 
and 5 and two from blocks 2, 4, and 6) that represent the interaction of 
A with blocks. The mean square for these four could be used to test A 
and BC, while the remaining factorial effects would be tested by the other 
8d.f.forerror. This device can usually be applied in the 2” series, though 
generally not in the mixed series. Further, the heterogeneity of the 
error variance may not be suspected. 

How frequently this danger arises will depend on the nature of the 
experimentation. Investigations by Yates (6.5) and Kempthorne (6.9) 
suggest that there is little danger with fertilizer experiments on English 
farm crops. In lines of research where errors have often been found to be 
heterogeneous, one should be on the alert for this effect. 

In some experiments the use of confounding is practically unavoidable, 
For instance, an experiment on methods of manufacturing ice cream, 
planned at North Carolina State College, contained 3 factors each at 3 
levels. It was desired to discover whether ordinary consumers could 
detect differences in the palatability of the 27 different types of ice cream. 
From previous experience it was believed that a taster (who was not an 
expert judge) could compare at most 3 ice creams at any single test. 
Moreover, it did not seem feasible to ask the tasters to grade the pal- 
atabilities on any kind of standard scale: they could merely be asked to 
rank the 3 ice creams in order of preference, if they had any preferences. 
Consequently, a design for a 33 factorial in blocks of 3 units was re- 
quired, even though this involved confounding of two-factor interactions. 
The design adopted also employed partial confounding of the main ef- 
fects, so as to obtain approximately equal accuracy on main effects and 
two-factor interactions. 
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TABLE 6.19 INDEX TO PLANS OF FACTORIAL EXPERIMENTS CONFOUNDED IN 
RANDOMIZED INCOMPLETE BLOCKS 
a. Designs with which any number of replicates may be used 


Num- Num- 


ber of _ ber of 

treat- units per 
Type ments block Interactions confounded in a single replicate * Plan 
23 8 4 ABC 6.1 
2f 16 8 ABCD 6.2 
24 16 4 AB, ACD, BCD 6.4 
2 32 8 ABC, ADE, BCDE 6.5 
28 64 16 ABCD, ABEF, CDEF 6.3 
28 64 8 ABC, CDE, ADF, BEF, ABDE, BCDF, ACEF 6.6 
(34 27 9 ABC (34) 6.7 
3s 81 9 ABC, ABD, ACD, BCD, all (34) 6.8 
a 16 4 AB (24) 6.12 
4x2? 16 8 ABC (28) 6.13 


*The fractions in parentheses give the relative information on the comparisons 
which are confounded. Where no fraction is given, the comparison is completely 
confounded. 

b. Balanced designs 


Number of 
replicates 
Number Units for a Interactions confounded and 
of treat- per balanced relative information (in paren- 
Type ments block design theses) * Plan 
Das 16 4 6n + All two-factor (5); all three-fac- 
tor (14) 6.4 
2 32 8 bn All three-factor (44); all four-fac- 
tor (44) 6.5 
98 64 8 10n All three-factor (44); all four-fac- 
tor (44) 6.6 
3 27 9 4n All three-factor (34) 6.7 
3f 81 9 4n All three-factor (34) 6.8 
8)x:2? 12 6 3n ft BC (86), ABC (56) 6.9 
3x2 24 6 38nt BC, BD, CD, all (84); ABC, ABD, 
ACD, all (56) 6.10 
3x2 18 6 4n AB (¥%), ABC (54) 6.11 
a 16 4 3n AB (%) 6.12 
4x2 16 8 3n ABC (2) 6.13 
4X3X2 24 12 On t AC (2847), ABC (2347) 6.14 


* The factors ABC --- are read from the left; thus the BC interaction in a 3 X 2° 
design is the interaction between the 2 factors at 2 levels. 

+ The symbol ‘‘6n”’ denotes that the number of replicates should be a multiple of 6. 

{In these cases only the balanced design is recommended. 
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6.22 Index to Experimental Plans. The opportunities for confound- 
ing vary with the number of factors and with the number of levels of each 
factor, ‘The plans given at the end of this chapter are those that are 
likely to be most frequently used. 

Table 6.19, which forms an index to the plans, is divided into two parts. 
The first part shows a group of designs in which any number of replicates 
may be used. In successive replicates, the experimenter may either (i) 
repeat the first replicate (with a new randomization), in which case the 
same comparisons are confounded in all replicates, or (ii) change the 
grouping into blocks in successive replicates, so as to spread the con- 
founding over a greater number of comparisons. In the plans, the fac- 
torial effects that are confounded are shown separately for each replica- 
tion. Consequently, the reader can readily see what possibilities are 
open in this direction. Where confounding can be restricted to a high- 
order interaction, as in the 2* design in blocks of 8 units, the first pro- 
cedure is preferable. On the other hand, with the 2* design in blocks of 
4 units, where one two-factor interaction must be confounded in every 
replicate, the confounding should be changed in successive replications, 
unlegs it is certain that one of the two-factor interactions will be neg- 
ligible. More detailed recommendations for the individual designs are 
given in section 6.3. 

The second part, table 6.196, shows a group of balanced designs, in 
which all interactions of the same order and the same type are con- 
founded to the same extent. Thus in the 3 X 2 X 2 X 2 (or 3 X 2°) 
design, all first-order interactions between the factors which have 2 levels 
are equally confounded, the relative information being 8/9. The first- 
order interactions between the factor at 3 levels and any factor at 2 levels 
are unconfounded. Balanced designs are useful in cases where all inter- 
actions of the same order and type are of equal interest, so that it is un- 
desirable to confound some more fully than others. Also, the computa- 
tions are usually simpler. For this reason, only the balanced designs 
should be used in certain cases, as indicated in table 6.19. On the other 
hand, with balanced designs the available numbers of replications are 
severely limited. 

Table 6.20 shows a number of designs for which plans are given in the 
references cited. These designs are mainly for mixed series, and will not 


be discussed here. 


6.23 Randomization. The procedure is the same for all designs. 
1. Arrange the positions of the blocks at random within each replica- 


tion, using a new randomization for each replication. 
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2. Allot the treatments in any block at random to the units in the 
block, using a new randomization for each block. 


TABLE 6.20 ConrouNDED DESIGNS FOR OTHER FACTORIAL EXPERIMENTS 


Units 
per Number of 
Type block replicates Interactions confounded Reference 
3? xX 2? 12 2n AB (%), ABCD (5) 6.6 
3x2 18 2n ABC, ABCD 6.6 
3x2 6 2n AB, AC, BC (14), ABD, ACD, BCD, ABC 6.1 
4X3 12 2n BC (%), ABC 6.6 
ex2 16 Any ABC 6.6 
“2X3 12 3n AB (2987), ABC 6.6 
43 16 Any ABC 6.7 
4x28 8 3n ABC, ABD, ACD 6.6 
at 16 Any ABC, ABD, ACD, BCD 6.7 
5? 5 4n AB (34) 6.8 
5x2? 10 5n BC (2465), ABC 6.6 
5 25 Any ABC 6.8 


6.24 Experiments in Single Replication. In most of the plans at the 
end of this chapter, the total number of treatment combinations is large. 
This fact limits the number of replications that can be employed. Some- 
times the resources are sufficient for only a single replication unless the 
experimenter omits some factors which he originally intended to include. 

With only one replication, we cannot derive an estimate of error from 
the interactions of treatments with blocks. If, however, certain high- 
order interactions are negligible, their mean squares in the analysis of 
variance will behave exactly like components of the error m.s., and there- 
fore can be used to provide an estimate of error. In the 2” series, the 
smallest design for which a single replicate is likely to be used is the 2° in 
blocks of 8 units (plan 6.5). In replicate 1 of this plan, ABC, ADE, and 
BCDE are completely confounded. If the remaining three-, four-, and 
five-factor interactions furnish the estimate of error, the analysis of vari- 
ance appears as follows. 


df. 
Blocks 3 
Main effects 5 
Two-factor interactions 10 


Enror (from high-order interactions) 13 


Total 31 
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Other designs in which a sufficient number of error d.f. are available 
from interactions among three or more factors are the 2°, the 3°, and the 
34 The interactions that constitute the estimate of error should be 
chosen before the results have been inspected. Recommendations for 
each of the designs above are given in the notes on the plans (section 
6.3). It may sometimes be wise to deviate from these recommendations. 
For instance, in the 2° example above, the main effects of A, C, and D 
and the AC and AD interactions might all turn out to be rather large. 
This would suggest that the ACD interaction may not be negligible, and 
that it should not be included in the interactions used as error. It will 
be realized that a decision of this type should ‘be made before examining 
ACD itself. The practice of examining the high-order interactions and 
using those that are small as error leads to a serious underestimation of 
the true error variance. 

The estimation of error from high-order interactions is, of course, open 
to criticism. If some of these interactions happen to be large, the error 
m.s. that is used will overestimate the true error variance and the fact 
that the interactions are large will not be discovered. Where a consid- 
erable series of experiments of the same general type is being conducted, 
some safeguard is obtained both by examining the high-order interactions 
in experiments that are replicated and by watching the two-factor inter- 
actions in experiments with single replication. If most two-factor inter- 
actions are small, it seems very unlikely (though still possible) that inter- 
actions of higher order will be large. On the other hand, if many two- 
factor interactions are found to be large, this suggests that some three- 
factor interactions may also be substantial. For further discussion, see 
Yates (6.5) and Cornish (6.10). 


6.25 Experiments with Fractional Replication. More recently several 
writers, Finney (6.11) and (6.12), Plackett, and Burman (6.18), and 
Kempthorne (6.14) have considered the use of only part of a replication 
in a factorial experiment. Not enough experience has accumulated to 
permit appraisal of the utility of these designs, and only an introduction 
to them will be given. They are most likely to prove useful where (i) a 
large number of factors is tested, so that even a single replication is not 
feasible, and (ii) all high-order interactions are known with confidence to 
be negligible. 

Tn order to see what happens when the experiment contains only part 
of a replication, consider a 23 factorial in which only the 4 treatment 
combinations a, b, c, and abe are tested. This is half of a complete rep- 
licate, It is clear that the three-factor interaction ABC cannot be esti- 
mated at all, since this represents the difference between the 4 treatments 
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that are tested and 4 that have not been tested. Finney (6.12) has called 
this comparison the defining contrast, since it defines the two halves into 
which the replicate is divided. Further, the main effect of A, as esti- 
mated from the difference between units with a and units without a; i.e., 


(abe) + (a) — (6) — ©) 


is exactly the same comparison as the BC interaction, as estimated from 
the difference between units that receive an even number of b’s or c’s and 
those that receive an odd number. Similarly, the estimate of B is the 
same as that of AC, while C is the same as AB. Factorial effects that are 
represented by the same treatment comparison are called aliases; thus 
BC is an alias of A. It has been shown (6.12) that the alias of any effect 
is its generalized interaction with the defining contrast. For example, 
the interaction of A with ABC is A?BC or BC. This result makes it 
easy to discover the alias of any factorial effect. 

If the experiment shows an apparent effect of ‘A, there is no way of 
telling from the results whether the effect was due to A, or to the BC 
interaction, or to a mixture of the two. This type of information is too 
vague to be satisfactory. If, however, it can be taken for granted that: 
all two-factor interactions are negligible, the ambiguity disappears, since 
we would conclude that an apparent effect of A was in fact due to A. 
With these assumptions, the half-replicate enables us to estimate the 
three main effects. 

This example brings out the fact that in order to obtain definite in- 
formation, it is necessary to assume that certain interactions are non- 
existent. The designs discussed by Finney and Kempthorne are adapted 
for situations where all interactions among three or more factors are 
negligible. In those presented by Plackett and Burman, even two-factor 
interactions are ignored. 

We will examine the construction of a half-replicate of the 2° factorial, 
since this is probably the smallest member of the 2” series for which 
fractional replication might be needed in practice. It is desired to obtain 
information on both main effects and two-factor interactions. First 
divide the treatment combinations into two sets of 32 by means of the 
defining contrast ABCDEF. Either set may be chosen as the half- 
replicate. The factorial effects are shown in table 6.21. 

Tt will be noted that, rather fortunately, every alias of a main effect 
or two-factor interaction is a high-order interaction of a type that is as- 
sumed non-existent. The alias of a three-factor interaction is another 
three-factor interaction. Since all three-factor interactions are assumed 
negligible, these components may be used to estimate error. Conse- 
quently, this plan, if set out as a completely randomized design in 32 
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units, enables all main effects and two-factor interactions to be estimated 
and provides 10 d.f. from three-factor interactions for the estimation of 
error. 

Since a completely randomized design is rather inaccurate for many 
types of experiment, an arrangement in blocks of 16 or 8 units may be 
preferable. If the 32 units are divided into two groups of 16, one fac- 


TABLE 6.21 Srrucrurn or A 2° FACTORIAL (HALEF-REPLICATE) 


Main effects Alias Main effects Alias 
A BCDEF D ABCEF 
B ACDEF E ABCDF 
(6) ABDEF F ABCDE 

Two-factor interactions Alias Two-factor interactions Alias 
AB CDEF BF ACDE 
AC BDEF cD ABEF 
AD BCEF CE ABDF 
AE BCDF CF ABDE 
APF BCDE DE ABCF 
BC ADEF DF ABCE 
BD ACEF EF ABCD 
BE ACDF 

Three-factor interactions Three-factor interactions 

(used as error) Alias (used as error) Alias 

ABC DEF ACE BDF 
ABD CEF ACF BDE 
ABE CDF ADE BCF 
ABF CDE ADF BCE 
ACD BEF AEF BCD 


torial effect and its alias are confounded with groups. The best: choice is 
one of the aliases from the set used as error, e.g., ABC and DEF. The 
plan is easily constructed. Suppose that the set of 32 which contains 
abcdef has been chosen as the half-replicate. This is divided into two 
groups of 16 such that one group (block 1) contains all treatment com- 
binations carrying an even number of the letters a, b, orc. The plan and 
analysis are shown in table 6.22. 

For a design in blocks of 8 units, we must confound with blocks an- 
other alias pair from the comparisons that are used as error. Suppose 
that ABD and its alias CEF are selected. It will be recalled (section 
6.13) that the generalized interaction of this pair with ABC and DEF is 
also confounded. By the usual rule, this interaction is the alias pair CD 
and ABEF. It is unfortunate that a two-factor interaction is con- 
founded, but further trial shows that this is unavoidable in blocks of 8 
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TABLE 6.22 A HALF-REPLICATE OF A 2° FACTORIAL IN BLOCKS OF 16 UNITS 


Block 1 Block 2 
() ad 
de ae 
df af 
ef bd 
ab be df. 
ac bf Blocks 1 
be ed Main effects 6 
abde ce Two-factor interactions 15 
abdf of Error (from three-factorial 
abef adef interactions) 9 
acde bdef _ 
acdf cdef Total 31 
acef abed 
bede abce 
bedf abef 
beef abcdef 


Note that the error d.f. are reduced from 10 to 9, since 1 d.f. has been assigned to 
blocks. 


units. To construct the plan (table 6.23), divide each block of table 6.22 
into halves that contain respectively an odd and even number of the 
letters c and d. 


TABLE 6.23 A HALF-REPLICATH OF A 2° FACTORIAL IN BLOCKS OF 8 UNITS 


Block 
4 2 3 4 
(1) de ae ad df. 
ef df af bd Blocks 3 
ab ac be ce Main effects 6 
abef be bf of Two-factor interactions 14 
acde abde cd abce Error 8 
acdf abdf abed abcf - 
bede acef edef adef Total 31 


bedf beef abcdef bdef 


In practice, the letters should be allocated to the factors so that CD 
is the two-factor interaction that is considered least important. In the 
analysis of variance, the sum of squares for blocks, main effects, and the 
unconfounded two-factor interactions are all computed in the usual way. 

Other experiments for which plans are described in reference (6.12) are 
a half-replicate of a 2” factorial in blocks of 8 units, a half-replicate of a 
98 factorial in blocks of 16, a quarter-replicate of a 2° factorial in blocks 
of 16, and a third-replicate of a 3° in blocks of 9. 
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For the case where two-factor interactions are ignored, Plackett and 
Burman (6.13) give methods for constructing a large number of plans in 
which all main effects can be estimated. The only restriction is that the 
number of units tested is a multiple of 4. For instance, there is a plan 
for a 2” factorial, using only 8 units (one-sixteenth of a replicate). All 
seven main effects can be obtained, though no comparisons remain for 
an estimate of error. There are further plans in 12, 16, 20, ete., units, 
giving 4, 8, 12, ete., df. for error. All these plans are set out for a com- 
pletely randomized design: with further confounding a reduced block 
size could be employed. As an illustration the plan with 16 units is given 
below. To facilitate study of this plan, the treatment combinations are 
shown both in our usual notation and by means of + and — signs to 
denote the two levels of each factor. 

Every factor occurs 8 times at each level. For every pair of factors, 
each of the sign combinations, ++, +—, —-+, and —-—, occurs four 
times. ‘Thus the estimates of all seven main effects are mutually orthog- 
onal, and are calculated in the usual way. With complete randomiza- 
tion, 8 d.f. are available for the estimation of error. 


TABLE 6.24 2! Facrorran in 16 UNITS (ONE-EIGHTH REPLICATE) 


Treatment combination 
Our + and — notation 
Unit notation 

a b c d e J g 

1 (1) = a = = a fe 3 
2 dg os ad os =e = = te 
3 cfg ts = oP = = op sth 
4 cdf = = ts ar > a rs 
5 bef = op = Teas ts = 
6 bdefg = Sy = a + oH =i 
7 beeg = ety afte a) Ohne ae tee 
8 bede = eH ts hag ay fe om 
9 ae ae = a re ah oe = 
10 adeg a3 = oo? + + =} ay 
au acefg Soe wee eg Swi porn ar 
12 acdef aR ED on ct a a = 
138 abf Ss a — 7 = oe bees 
14 abdfg a a St = a 5 
15 abeg + + = = fe + 
16 abed fe ae + ae = Ce = 
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Since it is not clear how frequently all factors can be assumed to be 
additive in their effects, the utility of these designs remains to be seen. 
As Yates (6.5) has pointed out, one case where there is additivity occurs 
when each factor represents an object to be weighed. Suppose that 
weighings are done on a chemical balance. Testing the + level of a 
factor corresponds to placing the object in the right-hand pan; testing 
the — level corresponds to placing the object in the left-hand pan. 

Thus the rows of table 6.24 provide a set of instructions for perform- 
ing 16 successive weighings on 7 objects. In the first weighing, all ob- 
jects are placed in the left-hand pan; consequently the weight that is 
read is the total weight of all 7 objects. In the second weighing objects 
dand g are placed in the right-hand pan and the others in the left-hand 
pan, and so on. This series of 16 weighings enables us to estimate the 
weight of each object as precisely as if all weighings had been devoted to 
that object alone. To make the analogy complete, a balance with a zero 
error would be needed, the zero error corresponding to the mean yield of 
an experiment, which can also be estimated from the 16 results. For a 
discussion and references, see Mood (6.15). 


6.3 Notes on the Plans and Statistical Analysis 


6.31 2” Series. In plans 6.1 to 6.6, one level of a factor is denoted 
by the corresponding letter; the other level by the absence of the letter. 
With designs that are not balanced, care must be taken to allocate the 
letters a, b, ¢, .. . to the factors in the experiment so that the interactions 
which are confounded are those that the experimenter has decided to 
confound. For instance, the first replication of plan 6.4 (2* design in 
blocks of 4) confounds the interaction AB, but no other two-factor inter- 
action. The factors labelled A and B should be those whose interaction 
is of least interest. 

The statistical analysis follows the same general method for all 2” de- 
signs. The total s.s. and the sum of squares among blocks are obtained 
by the usual methods. The factorial effects may be computed by one of 
the methods outlined in section 6.14. When this calculation has been 
completed, the sums of squares are added for all unconfounded compar- 
jsons. Let this total be Sr. If all the confounding is complete, S7 is the 
treatment s.s., since the effects that are confounded appear already 
in the block s.s. For each comparison which is partially confounded, the 
sum of squares is calculated separately from those replications in which the 
comparison is unconfounded. The replications to be used are clear from 
the plans, which show the factorial effects that are confounded in each 
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replication. The total of these sums of squares js added to Sr to give 
the treatment s.s. The error s.s. is found by subtraction. 

2 factorial, blocks of 4 units. Plan 6.1, which confounds ABC com- 
pletely, may be used for every replication. For a worked example, see 
reference (6.1), p. 19. 

Alternatively, with 4, or a multiple of 4, replicates, we may confound 
partially all two-factor interactions and the three-factor interaction, the 
relative information on each comparison being 3/4. The plan and sta- 
tistical analysis are described in reference (6.1), p. 21. 

2 factorial, blocks of 8 units. Plan 6.2 should be used for every replica- 
tion, ABCD being completely confounded, 

Note that the treatment s.s. contains only 14 d.f. since the single de- 
gree of freedom for ABCD is included in the blocks. 

2 factorial, blocks of 4 units. Plan 6.4 gives a balanced design with 6 
replications. If a smaller number of replicates is used, the replicates 
chosen depend on the interactions which the experimenter decides to 
confound. Since one two-factor interaction is confounded in each rep- 
licate, it will usually be preferable to confound different comparisons in 
successive replications. For instance, if AB and CD are to be con- 
founded, replicates 1 and 6 should be selected. This arrangement gives 
1/2 relative information on AB, CD and the three-factor interactions 
ACD, BCD, ABC, ABD, all other comparisons being clear of blocks. 

With a 2* experiment, the investigator may be uncertain whether to 
confound in blocks of 8 units or in blocks of 4 units. The former design 
is recommended unless (a) the interactions which are confounded with 
blocks of 4 units are known to be small or (0) the block of 4 units is much 
more homogeneous than the block of 8 units. The latter condition may 
apply where the experimental material is extremely variable, or falls 
naturally into groups of 4 units. 

2° factorial, blocks of 8 units. The balanced design (plan 6.5) requires 
5 replicates. When fewer than 5 replicates are used, it may be satis- 
factory to confound completely two three-factor interactions and one 
four-factor interaction. In this case replication 1 is repeated the re- 
quired number of times. 

If some information is desired on all treatment comparisons, replica- 
tions 1, 2, --- may be used up to the proposed number. With replications 
1 and 2, for instance, the interactions ABC, ADE, ABD, BCE, BCDE, 
and ACDE are partially confounded, each with relative information 1/2. 

The statistical analysis is described in reference (6.1), p. 27, for the 
case in which the experiment contains only a single replication. 

2 factorial, blocks of 16 units. Plan 6.3, which shows a single replica- 
tion with ABCD, ABEF, and CDEF confounded, may be used for all 
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replications. The 3 interactions above are completely confounded, and 
are omitted from the treatment s.s. in the analysis of variance. 

If there is only 1 replication, an estimate of error may be obtained 
from the unconfounded four-, five-, and six-factor interactions. These 
comparisons supply 22 d.f., of which three are confounded with blocks, 
leaving 19 for the estimate of error. The separation of the 63 d.f. is then 
as follows: 


df. 
Blocks 3 
Main effects 6 
Two-factor interactions 15 
Three-factor interactions 20 
Error (from high-order interactions) 19 


28 factorial, blocks of 8 units. In plan 6.6, the size of block is reduced 
from 64 to 8 units without confounding any two-factor interactions. 
Four of the 20 three-factor interactions and 3 of the 15 four-factor inter- 
actions are confounded in each replication. The balanced design requires 
10 replications, i.e., 640 experimental units, and necessitates a large ex- 
periment. 

With a.single replication, the 63 d.f. in the analysis of variance sub- 
divide as follows: 


df. 
Blocks 7 
Main effects 6 
Two-factor interactions 15 
Three-factor interactions 16 
Error (from high-order interactions) 19 


The simplest procedure when there is more than one replication is to 
confound completely a single set of 7 factorial effects by the repeated use 
of replication 1. For partial confounding, the successive replicates in 
plan 6.6 may be taken. In the first 4 replicates no three-factor inter- 
action is confounded more than once. 


6.32 3” Series. From plan 6.7 onwards, the levels or variations of 
each factor are denoted by the figures 0, 1, 2, ---, while the factors are 
read from left to right. The treatment 102 implies the middle level of 
the first factor, the lowest level of the second factor, and the highest 
level of the third factor. In an experiment the numbers 0, 1, 2, --- may 
be assigned to the levels or variations of a factor according to any con- 
venient scheme. 

The testing of three instead of two variations of each factor makes 
possible a more thorough evaluation of the effects of the factors. In 
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particular, the 3° factorial experiment has been much used in investiga- 
tions of the responses to fertilizers, since the effects of each of 3 different 
plant nutrients may be explored at 3 levels of application, which provide 
some information on the shape of the response-curve. 

3° factorial, blocks of 9 units. In plan 6.7, two of the 8 df. for ABC are 
completely confounded in each replication, so that the balanced design 
requires 4 replicates and provides 3 /4 relative information on ABC. 
Unless the three-factor interactions are known to be negligible, in which 
case replication 1 may be used for all replicates of the experiment, it is 
preferable to select the required number of replications in succession 
from the plan. For example, an experiment with 3 replicates gives 2/3 
relative information on 6 of the 8 interaction df., and full replication on 
the remaining 2 d.f. As usual, the rule in the analysis of variance is to 
estimate confounded comparisons from those replications in which they 
are not confounded. An example of the analysis with 4 replicates has 
already been given (section 6.17). 

For an example of the analysis with a single replicate, see reference 
(6.1), p. 53. If the three-factor interactions are used as error, the separa- 
tion of the degrees of freedom is as shown on the left in table 6.25. 


TABLE 6.25 SUBDIVISION OF THE DEGREES OF FREEDOM IN A 3° pxPERIMENT 
WITH SINGLE REPLICATION 


I. Three-factor interactions Il. For experiments where all 

used as error effects are approximately 
linear 

af. df. 

Blocks 2 Blocks 2 

Main effects 6 Main effects 6 

Two-factor interactions 12 AB: linear X linear 1 

Error 6 AG: linear X linear vt 

— BC: linear X linear 1 

Total 26 Error 15 

Total 26 


The number of error d.f., 6, is rather small. There is an alternative 
scheme for experiments where the effects of the factors are linear, or 
nearly so. Under these circumstances, the only components of the two- 
factor interactions that are likely to be substantial are the interactions of 
the linear responses to the factors. Hence, nine of the 12 d.f. for two- 
factor interactions are added to the material used for an estimate of error, 
as shown in table 6.25, Il. If in addition the linear components of 
the two-factor interaction are large, a further variant is to remove from 
the error the ABC: linear X linear X linear term. This leaves 14 df. 
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for error. The calculation of these components should be fairly simple 
with the help of the example worked in section 6.17. 

3* factorial, blocks of 9 units. With 4 factors each at 3 levels (plan 6.8) 
the block size is reduced from 81 to 9 units by confounding three-factor 
interactions. There are 4 three-factor interactions: ABC, ABD, ACD, 
and BCD, each with 8 df. Two components from each set of 8 are con- 
founded in any replication. If the experiment has more than 1 replicate, 
partial confounding is generally preferable, the required number of rep- 
licates being selected from the balanced set. The balanced design, with 
4 replicates, gives threefold replication on all three-factor interactions. 

Statistical analysis with a single replication. An analysis may be cal- 
culated without difficulty from the following partition of the degrees of 
freedom. 


df. 
Blocks 8 
Main effects 8 
‘Two-factor interactions 24 


Error (from three- and four-factor interactions) 40 


All sums of squares are calculated in the usual way. For the main effects 
and two-factor interactions, six 3 X 3 tables are required. The error, 
which is found by subtraction, contains 24 d.f. representing uncon- 
founded three-factor interactions and the 16 d.f. for the four-factor inter- 
actions. 

This analysis gives no test of the three-factor interactions. If a test 
of ABC, for example, is wanted, the first step is to calculate in the usual 
way the sum of squares for the 8 d.f. From this we must subtract the 
sum of squares for the 2 d.f. that are confounded with blocks. 

In plan 6.8, replication I, the 2 d.f. are denoted by ABC IV, where the 
numeral IV refers to replication IV of plan 6.7. If we ignore D, it will 
be found that blocks 1, 6, and 8 contain the same set of 9 treatments as 
block IVa in plan 6.7. Calculate the total yield of these 3 blocks (27 
units). Similarly, we calculate the total yield of blocks 2, 4, and 9, which 
correspond to block IVb in plan 6.7, and of blocks 3, 5, and 7, which cor- 
respond to block IVe. The sum of squares of deviations of the 3 totals, 
divided by 27, is subtracted from the sum of squares for the 8 d.f. The 
resulting 6 d.f. are removed from the error, which now has 34 d.f. 

The reader may verify that the 2 confounded degrees of freedom for 
BCD (i.e., BCD I1D) are obtained from the totals of blocks 1, 2, 3, blocks 
4, 5, 6, and blocks 7, 8,9. In this case A is ignored when plan 6.8 is com- 
pared with plan 6.7. 

Statistical analysis with more than one replication. If all three-factor 
interactions are presumed to be negligible, they may be combined with 
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the error. The sums of squares for blocks, main effects, and two-factor 
interactions are found in the usual way while the remainder constitutes 
the error. 

All 32 components of the three-factor interactions may, however, be 
calculated and tested. Partially confounded comparisons are taken only 
from replicates in which they are unconfounded. For example, if there 
are two replications of plan 6.8, all the data are used in the calculation of 
components ABC I and ABC II, which are unconfounded. For ABC IIT 
we use only the data from the first replicate of the experiment, and for 
ABC IV only the data from the second replicate. 


6.33 Mixed Series. 3 X 2? factorial, blocks of 6 units. Only the bal- 
anced plan (6.9) should be used; this may be repeated if extra replica- 
tions are wanted. Thus the number of replicates must be 3, or some 
multiple of 3. Although the BC interaction is partially confounded, the 
relative information is high (8/9). 

The statistical analysis is described, with a numerical example, in 
reference (6.1), p. 58. 

3? x 2 factorial, blocks of 6 wnits. In plan 6.11, AB and ABC are 
partially confounded. Hither 2 or 4 replicates may be used, though 4 
are required for complete balance. A numerical example for 4 replicates 
was given in section 6.19. 

With 2 replicates (say I and IT in plan 6.11) the J and R components 
are partially confounded, while the J and S components are uncon- 
founded. The J and RF totals are adjusted by the same formulae as is 
table 6.16, (iii) and (iv). The J and S totals need no adjustment. The 
interaction s.s. in the analysis of variance are computed as follows. 


AB(D = sigl(2hr’)? + (2l2')? + 203')7] — rogl2i’ + 2le! + 213/? 
AB) = poli? + Jo? + Js") — gels + Jo + Jal? 
ABO(R) = Pyl(2Ry’)? + (2Re’)? + (2Rs')?] 
— wgl2Ri’ + 2Ro! + 2R3']° 
ABC(S) = dslSi? + So? + S37] — gig [Si + S2 + Ss]? 


The AB two-way table and the table of individual treatment means 
are adjusted in the same way as with four replicates. The quantities QI, 
QR’ are divided by 18 and 6, respectively, to obtain 7’ and 7’. For 7 and 
r, we divide 7 and R each by 12. After taking deviations from the mean 
in each case, we calculate 62 and 6r. Thereafter the adjustment proceeds 
exactly as with 4 replicates, being somewhat simpler since J and S com- 


ponents do not enter. 
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3 X 2° factorial, blocks of 6 units. In plan 6.10 the interactions BC, 
BD, and CD, between pairs of factors that occur at 2 levels, are partially 
confounded with 8/9 relative information, while 5/9 relative information 
isretained on the interactions ABC, ABD, and ACD. Only the balanced 
design which requires 3 replicates js recommended. 

The following computations lead to the sum of squares due to partially 
confounded effects. Let 


g=albth—-l—-l gn’ =i-W+h — la; 
qa” =1,-h-k +e 


where Iy is the total of block I,, ete. Similar definitions hold for g2,- +, 
gs!’ in the other replications. Then find 


3Q = 3[CD] + gi + 92 + 933 3Q! = 3[BD] + gr’ + go! + gs’ 
3Q” = SBC] + 91" + 92" +90” 
3Ro = 3[CDa] — 91 + go + 933 8R, = 3[(CD,,] + 91 + 92 — 935 
3R2 = 8[CDa] + gi — 92 + 98 
3Ro! = 3IBDa) + 91’ — go’ + gs’; 3Ra! = BBD] — gi’ + 92! + 98" 
3R2! = 3[BDa,] + 91’ + 92’ — gs" 
3Ro!= 3(BCal +91" +92" —93";  8Ry'’ =3[BCa,] +91” — 92" +93"5 
BR" = 3[BCn] — 91” + ga” + 95” 
The sum of squares for the two-factor interactions are as follows. 
3, 2 ry 2 ry 2 
cp = 2"; np = 2”, po = SZ 


For AGD, divide the sum of squares of deviations of the quantities 3Ro, 
3R1, 3R2 by 120, with similar rules for ABD and ABC. 

The CD, BD, and BC two-way tables must be adjusted for block ef- 
fects. For the means per unit in the BC table, the adjustment is 


3Q”. [BC] 
192 72 


The adjustment is added (algebraically) when b and ¢ are both present 
or both absent; when b or ¢ alone is present the adjustment is subtracted. 

42 factorial, blocks of 4 units. In plan 6.12 each replication confounds 
completely a set of three of the 9 df. for AB. Since the relative informa- 
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tion on AB is 2/3, this plan gives less precise estimates of the interactions 
than randomized blocks unless the reduction in block size brings a sub- 
stantial decrease in the error m.s. The balanced design, which requires a 
multiple of 3 replicates, is preferable. 

With 3 replicates the analysis of variance is as follows. 


af. 

Replicates 
Blocks within replicates 9 
Main effects 6 
Two-factor interactions 9 
Error 21 
Total 47 


To compute the sum of squares for AB, let Bmn be the total of the nth 
block in the mth replicate, and let T'mn be the total of all treatments that: 
appear in this block (this will be a total over 12 units). Let Pmn = 
Tn — Bmn, 80 that there are 12 quantities Pn. Then 
AB = 4(Pur? + Pia? +e + P32) — gy (@ + Re? + Rn? + Ru”) 

where @ is the grand total and the R’s are replication totals. 

The estimated block mean (adjusted for treatment effects) is 

3Bnn — Tmn 

8 


To present a two-way table, the adjusted mean of treatment (00) is, for 
example, 


bmn 


(00)’ = (00) — (bry + brta + birt) 


since this treatment occurs in blocks Ib, Ta, and IIIb. 

4X 2 factorial, blocks of 8 units. In plan 6.13, one of the three com- 
ponents of ABC is confounded in each replicate. In other words, the dif- 
ference between the treatments in block la and those in block Ib is one 
component of ABC. The plan gives 2/3 relative information on ABC. 
Any number of replicates may be used, though three are needed for 
balanced confounding. 

To obtain the sum of squares for ABC, calculate the total for each 
component separately, taking it only from those replicates in which it is 
not confounded. The divisor for the square of this total is obtained by 
the usual rules. Thus with 2 replicates the first component is taken from 
replicate II and has divisor 16, the second component is taken from rep- 
licate I and has divisor 16, while the third component comes from both 


replicates and has divisor 32. 
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If there are r replicates, the adjusted block mean is 

TBmn — Tmn 
8(r — 1) 


where Byn is the block total and 7,» that of all treatments appearing in 
the block. With 2 replicates, the adjusted mean of treatment 111 is 


bra + bre 


mn 


111)’ = (11) — 


since this treatment is found in blocks Ia and Ib. 
4X3 X 2 factorial, blocks of 12 units. The 3 d.f. for the factor A at 
4 levels may be divided into the following components. 


A’ = ag + dz — a — A 
A” = a3 — dz — a + 4% 
Al” = ag — 2 + a — a 


In the first 3 replicates of plan 6.14, the interactions A’C and A’BC 
are partially confounded, with 8/9 and 5/9 relative information, re- 
spectively, the interactions of A” and A’” being unconfounded; A’’C and 
A’ BC are partially confounded in replicates 4 to 6 and A’”’C and A'BC 
jn replicates 7 to 9. Although nine replicates are required for complete 
balance, the design may be used with three or six replicates. In these 
cases the last three or the last six replications should be taken from plan 
6.14. The AC interaction suffers only a trivial loss of replication from 
the confounding. The statistical analysis is described in reference (6.6). 


6.34 Missing Values. When factorial experiments are arranged in 
randomized blocks or latin squares with no confounding, the formulae 
previously given (sections 4.25, 4.36) are used for the estimation of miss- 
ing values. With confounding, the formulae are changed, becoming in 
general more complicated. The different cases are discussed below. 


i. Complete confounding of the 2°, 2*, 2°, 2°, 3°, 3* factorials. When 
the effects that are confounded are unimportant, we recommended that 
the first replication of the plan be used for all replications of the experi- 
ment, with complete confounding of these effects. In this case the esti- 
mation formula is simple. Let B be the total yield of all other units in 
the same block as the missing unit, and T be the total yield of all other 
units which have the same treatment as the missing unit. Every replica- 
tion contains a block which has the same set of treatments as the block 
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with the missing value. Let B’ be the total yield of all such blocks (in- 
cluding the block with the missing value). 
The value for the missing unit is estimated by the formula: 


bate kT — B’ 

(r — 1I)(k — 1) 
where k is the number of units per block, and r is the number of replica- 
tions. 

Tt will be realized that this is the standard formula for randomized 
blocks (section 4.25), applied only to those blocks which contain the 
treatment with the missing value. 


ii, Partial confounding of the 23, 24, 2°, 2°, 3°, 34, 7, 4X 2 factorials. 
The formula given later is valid when a different replication of the plan 
is used for each replication of the experiment. A further essential con- 
dition is that no treatment comparison be confounded in more than one 
replication. This condition is satisfied in all plans for the designs above, 
except possibly plan 6.4 (2* in blocks of 4) and plan 6.6 (2° in blocks of 
8). For instance, if the first 2 replications of plan 6.6 are used, no inter- 
action is confounded more than once; but if the first 3 replications are 
used, BCDE is confounded twice and the formula does not apply. 

The formula requires some preliminary calculations, most of which 
can be used in the subsequent analysis of variance. In addition to B and 
T, as defined in case i, we require first the total F of all other units in the 
same replicate as the missing unit and the grand total @ of all other units 
in the experiment. In the other replicates, calculate the total S, of all 
blocks which contain the treatment with the missing unit. 

Next form the table of treatment totals, From this table compute 
the sum U of the observations for all other treatments which appear in 
the block which has the missing unit. In the other replications, calculate 
the corresponding sum for each block which contains the treatment with 
the missing unit. Let the total of these (r — 1) sums be V, where r is 
the number of replicates. Note that in the calculation of U and V the 
treatment with the missing value is not included. 

The missing yield is estimated by the equation 


Mer — 1)T +o —DB+ kG + tV — krR — tr — 1)U — trSy, 
: @— Dee — DE- 1) = =H) 
The table on p. 204 which shows the number of units whose observa- 
tions are added to obtain the various totals 7, B, «:+, may be helpful in 


checking that the totals have been correctly identified. 
Consider U, for example. There are (k — 1) other treatments in the 


y 
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block with the missing value, and each treatment total is the sum of r 
observations. Consequently U is a total of r(é — 1) individual observa- 
tions. 


Total r B G Vv R U Ss 


Number of 
units (r —1)|(& —1)| & — 1) |r@ — IK —1)| ¢-— DY | re — 1] ker - 1) 


iii. Partial confounding of the 8 X 27, 3 X 2, #X24X8X 2 fac- 
torials. For these designs the general method given by Yates (6.16) 
should be followed. 

iv. Single replication of the 2°, 2°, 8°, 8* factorials. A missing value is 
estimated by minimizing the sum of squares for the interactions that are 
used as error. 

The estimation of missing values in cases iii and iv requires a knowl- 
edge of the algebra of the analysis of variance. The expressions for the 
missing value are usually fairly simple in case iv but are complicated in 
case iii. 

When the estimated value has been substituted, the analysis of vari- 
ance is calculated by the appropriate method for the design. As usual, 
the error d.f. are reduced by one for each estimated value. Variance- 
ratio tests of the treatment effects are slightly disturbed. For ¢-tests, an 
approximate rule is to decrease the number of replicates ascribed to any 
mean by 114 for each estimated value that the mean contains. Suppose, 
for instance, that (abc) is missing in a 2° experiment with 3 replications. 
The mean of the units which receive a is taken over 12 observations, one 
of which is an estimated value. By the rule, the standard error of this 
mean is computed as s/V 104 instead of s/ V12. Consequently the 
standard error of the mean response to A is taken as 


Nags 
eM (te by ae 
103 12 


2s? 


12 


instead of 


6.35 Estimation of the Gain in Precision from Confounding. As men- 
tioned previously, the advisability of confounding depends on the 
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amount by which the experimental error is decreased when the block size 
is reduced. It is therefore worth while, especially where confounding has 
not been previously used, to estimate the gain in precision which has been 
obtained in each experiment. 

The technique is essentially the same for all types of design. From the 
results of the analysis of variance of a confounded experiment, an esti- 
mate EZ, is made of the experimental error which would have been 
present if the experiment had been laid out in randomized complete 
blocks. 


Single replication. Let Ey be the mean square for blocks, and #7, the 
mean square for error. If there is only 1 replication in the experiment, 
the estimate H, is 

nL, + nBe 


E,, = ————_ 6.1 
Ny + Ne 6) 

where n, = number of degrees of freedom for blocks. 
Ne = total number of degrees of freedom minus no. . 


For example, with a 2° experiment in blocks of 8 units (single replication) 
nm = 7; Ne = 638 —7 = 56 


The estimate Z, is directly comparable with the estimated error #,. As 
shown in section 2.31, the comparison may be expressed in terms of the 
relative amounts of replication required to obtain equal accuracy with 
the two types of layout. 

The estimate H, is subject to the assumption that the interactions 
which are confounded with blocks are negligible. Since confounding is 
complete, no method of estimation avoids this assumption. If the con- 
founded interactions are not negligible, the calculation overestimates the 
gain from confounding. 

More than one replication. If the confounding is complete, formula 
6.1 holds for Z,, subject again to the assumption that the confounded 
comparisons are negligible. In this case 

Ey = mean square for blocks within replications. 


E, = error mean square. 
nm, = number of degrees of freedom for blocks within replications. 


ne = total number of degrees of freedom minus number of degrees of 
freedom for replications mznus ny. 
Thus with 2 replicates of a 2° design in blocks of 8 units, the same inter- 
actions being confounded in both replicates, we have 


np = 14, Ne = 127 —1— 14 = 112 
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With partial confounding, the same formula may be used if all con- 
founded comparisons are assumed to be small. Evidence on the validity 
of this assumption is provided by the experimental results, since partially 
confounded comparisons can be tested for significance. 

It is possible to avoid the assumption that confounded comparisons are 
negligible, though some extra computations are necessary. We first 
adjust the mean square among blocks within replicates so as to remove 
treatment effects. The following identity holds for the sums of squares. 


Blocks within replicates (adj.) + treatments (unadj.) = 
blocks within replicates (unadj.) + treatments (adj.) 


The two quantities on the right of the equation appear in the analysis 
of variance of the experimental results. Consequently the adjusted 
blocks s.s. is found from this equation by calculating the unadjusted 
treatments s.s. From the adjusted blocks s.s. we obtain, as shown 
below, a new estimate H,’ for use in formula (6.1). 

_ These procedures may be illustrated from the results of the 3 X 3 X 2 
experiment in 4 replicates (section 6.19), in which AB and ABC are 
partially confounded. From table 6.15 the following summary of the 
analysis of variance is obtained. 


df. 5.8. m.s. 
Replications 3 3,837 1,279 
Blocks within replications 8 2,836 354 
Treatments 17 7,802 459 
Error 43 8,909 207 
Total 71 23 , 384 


If AB and ABC@ are assumed negligible, the estimate H, is found by 

formula (6.1) for the case where there is more than one replication. 
8 X 354 + 60 X 207 
Re ee ee 
: 68 

In order to avoid the assumption, we calculate the unadjusted treat- 
ments s.s., which is found to be 8328. Since the adjusted treatments 
s.s. is 7802, we have 


Blocks s.s. (adj.) = 2836 + 7802 — 8328 = 2310 
corresponding to a mean square H,, of 289. 
An unbiased estimate HZ,’ of the variance between blocks within rep- 


licates is 
4E,,—E. 4X 289 — 207 


3 3 


Ey! = 


16 
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This value is inserted in formula (6.1) to give E,’: 
8 X 316 + 60 X 207 
68 ‘ 

The estimated gain in efficiency over randomized blocks is 8% when 
AB and ABC are assumed negligible and 6% without this assumption. 
The two results agree closely, as is usually the case unless the confounded 
interactions are large. 

The method above applies to all designs with partial confounding. 
For the designs discussed in this chapter, the formula for Hy’ in terms of 
E,, and EH. is 
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e 


rE, — Ee 
y r—1 
where r is the number of replicates. 
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PLANS 
Plan 6.1 2° factorial, blocks of 4 units 
Rep. I, ABC confounded 


abe ab 
a a 
bbe 
e () 


Plan 6.2 2 factorial, blocks of 8 units 
Rep. I, ABCD confounded 


Plan 6.3 2° factorial, blocks of 16 units 
Rep. I, ABCD, ABEF, CDEF confounded 
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Plan 6.4 Balanced group of sets for 2* factorial, blocks of 4 units 


Two-factor interactions are confounded in 1 replication and three-factor 
interactions are confounded in 3 replications. The columns are the blocks. 


Rep. I, AB, ACD, BCD confounded Rep. II, AC, ABD, BCD 


(1) ab a b (1) ac a c 


abe c¢ be ac abe b be ab 
abd d bd ad acd d cd ad 
cd abcd acd bed bd abed abd bed 


Rep. III, AD, ABC, BCD Rep. IV, BC, ABD, ACD 


(1) ad a @ (1) be b c 


abd b bd ab abe a ac ab 
acd ¢ cd ac bed d cd bd 
be abcd abe bed ad abed abd acd 


Rep. V, BD, ABC, ACD Rep. VI, CD, ABC, ABD 


(Dba bd (1)\ cd sen aid 
abd a ad ab acd a ad ac 
bed c cd be bed b bd be 
ac abed abe acd ab abed abe abd 


Plan 6.5 Balanced group of sets for 2° factorial, blocks of 8 units 


Three- and four-factor interactions are confounded in 1 replication. 


Rep. I, ABC, ADE, BCDE confounded Rep. Il, ABD, BCE, ACDE 


(6) b GQ) aboa b 

be “ac abe ad bd ad abd 
abd d bd ad abe ¢ be ac 
acd bed be abed bed acd abed cd 
abe ¢ ce ae abe eé be ae 
ace bce ade abce bde ade abde de 

de  abde abcde bde ce  abce ace bce 
bede acde cd cde acde bede cde abcde 
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Plan 6.5 (Continued) 
Balanced group of sets for 2° factorial, blocks of 8 units 


Rep. III, ACE, BCD, ABDE Rep. IV, ACD, BDE, ABCE 

(1) ac a c QQ) ada d 

ae ce e ace ac cdc acd 

abe b be ab abd b bd ab 

bee abe abce be bed abe abed be 

ad ad ed ad ade ¢é dee 

cde ade acde de cde ace acde ce 

bd = abed abd bed be abde abe  bde 
abde bede bde abcde abce bede bee abcde 


Rep. V, ABE, CDE, ABCD 


(1) ae a € 
ab ibe b abe 
ace oc cea 
bee abe abce be 

. ade d de ad 
bde abd abde bd 
ed acde acd cde 
abed bede bed  abede 


Plan 6.6 Balanced group of sets for 2° factorial, blocks of 8 units 
All three- and four-factor interactions are confounded in 2 replications. 
Rep. I, ABC, CDE, ADF, BEF, ABDE, BCDF, ACEF confounded 


abe a b (be ac c ab 

bd ed abed acd abd d ad bed 
ae abee ce bee e abe be ace 
ede bde ade abde acde bede abcde de 

ef of af abf acf bef abef f 

adf abedf cdf bedf df  abdf bdf aca 
bef cef  abcef acef abef ef aef beef 
abcdef adef bdef def bedef acdef cdef  abdef 


Rep. Il, ABD, DEF, BCF, ACE, ABEF, ACDF, BCDE 


abd b a (1) ad bd d ab 

ed ac be abe bed acd abed c 

be abde de ade e abe ae bde 
ace ede abcde bede abce ce bee acde 
af df abdf vif abf f dbf  adf 
bef abedf cdf acdf cf abef acf — bedf 
def aef bef abef bdef adef abdef ef 
abedef beef acef cef  acdef bedef cdef  abcef 
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Plan 6.6 (Continued) 
Balanced group of sets for 2° factorial, blocks of 8 units 


Rep. III, ABE, BDF, ACD, CEF, ADEF, BCDE, ABCF 


be a ac (Ql) abe ab b €. 

acd bd bed abd cd d ad abed 
abe ce e ace be bce abce ae 

de abede abde bcede ade acde cde bde 
af bef of abef f of acf abf 
bdf acdf adf cdf abdf abedf bedf af 
cef abef abcef bef acef aef ef beef 
abedef def  cdef adef bedef bdef abdef acdef 


Rep. IV, ABF, CDF, ADE, BCE, ABCD, BDEF, ACEF 


ac a b Gy se abe be ab 
bd bed acd abed abd d ad ed 
bee be ae abe abce ce ace € 
ade acde ede cde de abde bde  abede 
(Of a aabepe RL bh DEE ati, acf 
cdf dj abdf-adf acdf bedf abedf baf 
of cef abcef acef aef bef  abef beef 
abedef abdef def  bdef bedef acdef cdef adef 


Rep. V, ACF, BCD, ADE, BEF, ABDF, CDEF, ABCE 


ab a be (Dye ac c abe 
bed ed abd acd abed d ad bd 
ce bee = ae abee ace be abe eé 


ade abde cde de de abede bede  acde 
acf abef ff bef of abf bf af 

df bdf acdf abdf adf  bedf abedf cdf 
bef ef abcef aef abef  cef acef beef 
abcdef acdef bdef  cdef bedef adef def  abdef 


Rep. VI, ABC, BDE, ADF, CEF, ACDE, BCDI, ABEF 
Interchange B and C in replication I 

Rep. VIL, AB, DEF, BCD, ACE, ABDE, ACDF, BCEF 
Interchange F and D in replication II 

Rep. VIII, ABE, BDF, CDE, ACF, ADEF, ABCD, BCEF 
Interchange A and E in replication III 

Rep. IX, ABD, CDF, AEF, BCE, ABCF, BDEF, ACDE 
Interchange F' and D in replication IV 

Rep. X, AEF, BDE, ACD, BCF, ABDF, CDEF, ABCE 
Interchange H and C in replication YW 
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Plan 6.7 


Plan 6.8 


CONFOUNDING 


Balanced group of sets for 3° factorial, blocks of 9 units 
ABC confounded 


Rep. I 


Rep. II 


Balanced group of sets for 34 factorial, blocks of 9 units 


Three-factor interactions confounded 
Rep. I, ABC IV, ABDI, ACD II, BCD III confounded 
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Plan 6.8 (Continued) 
Balanced group of sets for 34 factorial, blocks of 9 units 


Rep. II, ABC III, ACD IV, ABD Il, BCD IT 
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Rep. III, ABC I, ACD III, ABD IV, BCD IV 
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IV, ABC I, ACD I, ABD II, BCD IL 
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Plan 6.9 Balanced group of sets for 3 X 2? factorial, blocks of 6 units 
BC, ABC confounded 


Rep. I Rep. II Rep. III 
a ob a ob a b 
001 000 000 001 000 001 
010 O11 O11 010 O11 010 
100 101 101 100 100 101 
111 110 110 111 te LO! 
200 201 200 201 201 200 


211 210 211 210 210 211 


Plan 6.10 Balanced group of sets for 3 X 2° factorial, blocks of 6 units 


BC, BD, CD 
ABC, ABD, ACD confounded 


Rep. I 
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Plan 6.11 Balanced group of sets for 3° X 2 factorial, blocks of 6 units 
AB, ABC confounded 
Rep. I ‘Rep. I 
a b c a b ¢ 
4 100 200 000 200 000 100 
{ 210 O10 110 010 110 210 
020 120 220 120 220 020 
201 001 101 101 201 001 
O11 111 211 211 O11 111 
j 121 221 021 021 121 221 
Rep. III Rep. IV 
a b c a b c 
100 200 000 200 000 100 
010 110 210 110 210 010 
220 020 120 020 120 220 
201 001 101 101 201 001 
11 211 Ol Ol 111 211 
021 121 221 221 021 121 
Plan 6.12 Balanced group of sets for 4* factorial, blocks of 4 units 
AB confounded 
Rep. I Rep. IL Rep. II 
ab 1c! a > co @ Gib: 1) aa: 
33 32 31 30 33 30 32 31 33 31 32 30 
22 23 20 21 21 22 20 23 20 22 21 23 
10 11 12 18 12 id) 18; 10: 11 18 10 12 
01 00 03 02 00 03 O1 02 02 00 03 O1 
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Plan 6.13 Balanced group of sets for 4 X 2? factorial, blocks of 8 units 


ABC confounded 

Rep. I Rep. II Rep. II 

a b a 6 a b 
000 001 000 001 000 001 
O11 010 O11 010 O11 010 
100 101 101 100 101 100 
lil 110 110 111 110 111 
201 200 201 200 200 201 
210 211 210 211 211 210 
301 300 300 301 301 300 
310 311 311 310 310 311 


Plan 6.14 
Balanced group of sets for 4 X 3 X 2 factorial, blocks of 12 units 


A'C, A’BC confounded 
Rep. I Rep. Il Rep. II 
6 wb ab ae 


000 001 001 000 001 000 
O11 010 010 O11 011 010 
021 020 021 020 020 021 
100 101 101 100 101 100 
1i1 110 110 111 lt 110 
121 120 121 120 120 121 
201 200 200 201 200 201 
210 211 211 210 210 211 
220 221 220 221 221 220 
301 300 300 301 300 301 
310 311 311 310 310) 311 
320 321 320 321 321 320 
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Plan 6.14 (Continued) 
Balanced group of sets for 4 X 3 X 2 factorial, blocks of 12 units 


A’C, A" BC 
Rep. IV Rep. V Rep. VI 
Cie) ay a0 gC pet 
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CHAPTER 7 


FACTORIAL EXPERIMENTS WITH MAIN EFFECTS 
CONFOUNDED: SPLIT-PLOT DESIGNS 


7.1 The Simple Split-plot Design 


7.11 Description. In field experiments an extra factor is sometimes 
introduced into an experiment by dividing each plot into a number of 
parts. For example, if the experiment is planned originally to test a 
factor A with five levels, the division of each plot into halves permits the 
inclusion of an extra factor B at two levels. Within each plot the two 
levels of B are allotted at random to the two sub-plots. If the whole 
plots are in a randomized block design, the plan (after randomization) 
might appear as shown in table 7.1. It is worth noting the difference be- 


TABLE 7.1 EXAMPLE OF A SPLIT-PLOT DESIGN 


Rep. 1 Rep. 2 Rep. 3 
a3 a a a aM aq a4 a ad a3 a ag a a 4% 
bo} |b1} |b0] |b} | bo bi} |b} | bo} | bo} | bo bi] | bo} |bo} [bo] }b1 
bi} | bo} | br) | br] | br bo} | bo} | bi} | br] | br bo} |b1} | bi} | br} | bo 


tween this arrangement and ordinary randomized blocks. In the latter 
the ten treatment combinations are assigned to the ten sub-plots in a 
replication completely at random. Here we have a more orderly assign- 
ment in which the two treatment combinations that have any given level 
of A always appear in the same whole-plot. 

This type of arrangement is common in industrial experimentation, 
although the connection may not at first be realized. Frequently, one 
series of treatments requires rather a large bulk of experimental ma- 
terial, while another series can be compared with much smaller amounts. 
For instance, the comparison of different types of furnace for the prep- 
aration of an alloy would use much greater amounts of alloy than the 
comparison of different types of mould into which the alloy might be 
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poured. In an experiment in which both factors are to be tested, the 
natural procedure is to take the material prepared in any furnace, and 
pour some of it into each mould. That is, material prepared in one 
furnace at one time provides a complete replication for the comparisons 
among moulds, just as the plot containing any level of A provides a com- 
plete replication of the factor B. Another instance is the comparison of 
different machines for milking dairy cows. Each machine would neces- 
sitate rather substantial amounts of milk, whereas other comparisons, 
for example on the best method of pasteurizing or of cooling, could be 
conducted with a much smaller amount of milk per treatment. The 
produce from any machine could be subdivided for these subsequent 
tests. 

We may describe this design in another way that brings out more 
clearly its relation to the confounded factorial designs discussed in chap- 
ter 6. If the sub-units are regarded as the experimental units, it is seen 
that the treatments do, a, . . . , a4 are applied to groups or blocks of two 
units. Differences among these blocks are confounded with differences 
among the levels of A; i.e., the main effects of A are confounded. <Ac- 
cordingly, the split-plot design is sometimes considered as one in which 
certain main effects are confounded, as contrasted with the designs in 
chapter 6, where the confounding is restricted to interactions. 


7.12 Nature of the Experimental Error. In the statistical analysis, 
account must be taken of the fact that the observations from different 
sub-units in the same unit may be correlated. In field experiments this 
correlation is just a reflection of the fact that neighboring pieces of land 
tend to be similar in fertility and in other agronomic properties. In in- 
dustrial experiments the same correlation may be present, because any 
factor that affects the whole batch of alloy prepared in one furnace at 
one time will tend to create similarity among smaller batches poured 
from it. 

The mathematical analysis used to examine the effects of this correla- 
tion will be illustrated for the experiment in table 7.1. Let 7 refer to the 
replication, j to the level of A, and & to that of B: for sub-units in the 
same unit, the ¢ and j subseripts will be the same. The assumption is 
made that a correlation p exists between the experimental errors ¢;;; and 
éiju for any two sub-units in the same unit. Sub-units in different units 
are assumed to be uncorrelated. Mathematically, we have 


Eleijeeiju) = po”; — E(Cijxestu) = 0 


We now consider the effects of this model on the most important treat- 
ment comparisons. The main effects of A are calculated entirely from 
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unit totals or means. With two sub-units per unit, the error variance of 
a unit total is 


EGig, + ej)? = E(@ijr?) + E(eejn”) + 2E (csjreij2) = 0? + 0? + 2po? 
= 207(1 + p) 


The factor 2 may be regarded as representing the effect of adding over 
2 sub-units. Consequently, for the main effects of A, the appropriate 
error variance per sub-unit is o7(1 +p). If there are 8 sub-units per 
unit, the corresponding variance works out at o7[1 + (8 — 1)p]. 

The main effects of B, on the other hand, are derived from the dif- 
ference between the two sub-units in a unit. For this we have 


Ej, — e:j2)? = 207(1 — p) 


Thus the effective variance per sub-unit applicable to the main effects 
of B is o°(1 — p). This expression remains unchanged when there are 
6 sub-units per unit. This variance also applies to any component of the 
AB interaction, since such components involve comparisons of (b; — bo) 
at different levels of A. 

For other treatment comparisons, the basic error variance may be 
different from either of the two expressions above. Consider, for in- 
stance, (aso — aybo), a comparison of a3 with a; at the zero level of B. 
In any replication the 2 sub-units involved are in different units, and are 
therefore independent. The variance of their difference is therefore 
20”, and the basic variance per sub-unit is 0”. However, the appropriate 
variance for all other comparisons of this type can be derived from the 
basie variances o7[1 + (6 — 1)p] and o7(1 — p). As will be seen, the 
analysis of variance gives unbiased estimates of these two variances, 
and from these, unbiased estimates of any particular variance can be 
obtained. 

In practice, p is nearly always positive. The result is that the main 
effects of A (the factor applied to the units) are less precisely estimated 
than those of B or than the AB interaction. 

The analysis of variance is fairly easy. For the example in table 7.1 we 
first compute the 15 plot totals. Their sum of squares of deviations is 
partitioned in the usual way into 2 d.f. for replications, 4 for the main 
effects of A, and 8 for the experimental error applicable to a whole-plot. 
The mean square for the latter is an unbiased estimate of o7(1 + p). 
All computations are divided by 2 to convert them to a sub-unit basis. 

Next take the difference (b; — bg) on every unit. The 15 differences 
provide 1 d.f. which represents the main effect of B, 4 d.f. which represent 
the AB interactions, and the remaining 10 d.f. whose mean square gives 
an unbiased estimate of the sub-plot error variance o?(1 — p). All sums 


713 COMPARISON WITH RANDOMIZED BLOCKS 221 


of squares are again divided by 2. The complete separation of degrees 
of freedom is shown in table 7.2. 


TABLE 7.2 ANALYSIS OF VARIANCE FOR THE SPLIT-PLOT EXPERIMENT IN 


TABLE 7.1 
df. 
Whole plots 
Replications 2 
A 4 
Whole-plot error 8 
Total 14 
Sub-plots 
B 1 
AB 4 
Sub-plot error 10 . 
Grand total 29 


The point to be noted is that the whole-plot error is computed en- 
tirely from plot totals, and the sub-plot error entirely from the differences 
between sub-plots in the same plot. It will be seen that the grand total 
of the degrees of freedom is 29. As might be expected, the corresponding 
sum of squares is the sum of squares of deviations of the 30 observations 
from their mean, A computation of this quantity provides a check on all 
sums of squares. In practice, particularly with more than 2 sub-unit 
treatments, we usually find the sub-plot error by subtraction. 


7.18 Comparison with Randomized Blocks. The experiment in table 
7.1 might be arranged in ordinary randomized blocks, with 3 blocks of 
10 treatments each. In a consideration of the relative merits of the thvo 
arrangements, the following points are relevant. 

1. With the split-plot design, usually the B and the AB effects are 
estimated more precisely than the A effects. Moreover, the number of 
degrees of freedom available for the experimental error m.s. is smaller 
for whole-unit comparisons than for sub-unit comparisons. 

2. It can be shown that the average experimental error over all treat- 
ment comparisons is the same for both designs. Consequently there isno 
net gain in precision resulting from the use of the split-plot design; the 
increased precision on B and AB is obtained by the sacrifice of precision. 
on A. For tests of significance or the construction of confidence limits 
the randomized block design holds a slight advantage on the average 
since it provides more degrees of freedom for the estimate of the single 
error variance that it requires. For instance, the experiment cited has 
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8 df. for the whole-unit error and 10 d-f. for the sub-unit error, whereas 
randomized blocks would provide 18 d.f. 

3. As we have indicated, the chief practical advantage of the split- 
plot arrangement is that it enables factors that require relatively large 
amounts of material and factors that require only small amounts to be 
combined in the same experiment. If the experiment is planned to in- 
vestigate the first type of factor, so that large amounts of material are 
going to be used anyway, factors of the second type can often be in- 
cluded at very little extra cost, and some additional information obtained 
very cheaply. 

To summarize, the split-plot design is advantageous if the B and AB 
effects are of greater interest than the A effects, or if the A effects cannot 
be tested on small amounts of material. 

Two disadvantages have been mentioned by experimenters. Some- 
times the whole-unit error is much larger than the sub-unit error. It 
may happen that the effects of A, though large and exciting, are not 
significant, whereas those of B, which are too small to be of practical 
interest, are statistically significant. The experimenter tends to be 
uncomfortable in reporting results of this type. Secondly, the fact that 
different treatment comparisons have different basic error variances 
makes the analysis more complex than with randomized blocks, espe- 
cially if some unusual type of comparison is being made. 

When the number of replications and the experimental conditions are 
suitable, the whole units may be arranged ina latin square. A split-plot 
latin square, which eliminates the error variation arising from two types 
of grouping, may be preferable to randomized blocks. Summarizing 22 
field experiments in latin squares where the plots were split into halves, 
Yates (7.1) found a substantial net increase in precision over ran- 
domized blocks. The superiority of the latin square was so pronounced 
that even the whole-plot comparisons would have been less precisely de- 
termined in randomized block designs. Factorial combinations that lend 
themselves to the use of split-plot latin squares are the 5 X 2D 3; 
5X 4,6 X2,6X3,7X2,7*3,8x2. 


7.14 Randomization. The treatments applied to the wnits are ran- 
domized according to the instructions for the design (e.g. randomized 
blocks, latin square, ete.) in which the units are arranged. The treat- 
ments applied to the sub-wnits are allotted at random within each unit. 
A separate randomization is carried out for each unit. 


7.15 Statistical Analysis. If the factor A (applied to the units) con- 
tains @ levels, and the factor B (applied to the sub-units) contains 8 
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levels, the subdivision of degrees of freedom in the analysis of variance is 
shown in table 7.3. The details of the analysis are illustrated in section 
7.17. 

TABLE 7.3 Partition OF DEGREES OF FREEDOM FOR A SPLIT-PLOT DESIGN 


Units arranged in randomized blocks Units arranged in a latin square 


(r replicates) (r = @ replicates) 
Units df. Units df. 
Blocks (r —1) Rows (a — 1) 
A (a — 1) Columns (@ — 1) 
Error (a) (a — 1)(r7 — 1) A (a — 1) 


Error (a) (a — 1)(a — 2) 
Total (ra — 1) a 
Total (a? — 1) 


Sub-units Sub-units 

B (8 — 1) B (6 — 1) 

AB (a — 1)(8 — 1) AB (a — 1)(6 — 1) 

Error (6) a(r — 1)(8 — 1) Error (b) (a — 1)(6 — 1) 
Total ra(s — 1) Total (8 — 1) 


7.16 Standard Errors. Let 2, and HZ, be the error mean squares for 
error (a) and error (b), respectively, on a sub-unit basis. For the treat- 
ment means, also expressed on a sub-unit basis, the standard errors 
shown in table 7.4 apply. The final comparison in table 7.4 contains 


TABLE 7.4 STANDARD ERRORS FOR THE SPLIT-PLOT DESIGN 


Treatment comparison 8.e. 
Difference between two A means: e.g., [(a1) — (ao)] VB a/rB 
Difference between two B means: e.g., [(b1) — (bo)] V 2K, /ra 
Difference between two B means at the same level of A: 
e.g., [(aibi) — (aibo)] V/2Ey/r 
Difference between two A means at the same level of B: 
e.g. [(aaby) — (aobr)] V2((8 — 1) By + Eal/rB 


both the main effect of A and the AB interaction; consequently the ap- 
propriate error is a weighted mean of 2, and 2). This error also applies 
to the difference between two A means which have different levels of B. 
In such cases the ratio of the treatment difference to its standard error 
does not follow the é-distribution. For practical purposes the approx- 
imate rule of section 4.14 may be used, though this method gives slightly 
too few significant results. Let ta, t be the significance levels of ¢ corre- 
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sponding to the degrees of freedom in HZ, and E;, respectively. The 
significance level of ¢ is taken as 
(6 — 1)Eots + Eota 
~"@-DB+E 


For an application, see the next section. 


7.17 Numerical Example. In an experiment on the preparation of 
chocolate cakes, conducted at Iowa State College (7.2), 3 recipes for 
preparing the batter were compared. Recipes I and II differed in that 
the chocolate was added at 40°C. and 60°C., respectively, while recipe 
II contained extra sugar. In addition, 6 different baking temperatures 
were tested: these ranged in 10° steps from 175° to 225°. Bach time that 
a mix was made by any recipe, enough batter was prepared for 6 cakes, 
each of which was baked at a different temperature. Thus the recipes 
are the “whole-unit” treatments, while the baking temperatures are the 
“sub-unit” treatments. There were 15 replications, and it will be as- 
sumed that these were conducted serially according to a randomized 
blocks scheme: that is, one replication was completed before starting 
the next, so that differences among replicates represent time differences. 
(The notes suggest that this was done, though they are not quite ex- 
plicit.) 

A number of measurements were made on the cakes. The measure- 
ment presented here is the breaking angle. One half of a slab of cake is 
held fixed, while the other half is pivoted about the middle until breakage 
occurs. The angle through which the moving half has revolved is read 
on a circular scale. Since breakage is gradual, the reading tends to have 
a subjective element. The data are shown in table 7.5. 

Tt is customary to compute the analysis of variance on a sub-unit 
basis. To avoid confusion, this should be clearly stated in the analysis 
of variance table itself. The calculations may be presented in three 
steps. 

Step 1. Analyze the whole-unit totals by the method appropriate to 
the design in which they are arranged. 


2 
Correction factor: 878) = 278,596 
(269)? + (260)? +... + (155)2 
Total: ) ( v peat) 278,596 = 11,538 
(843)? + (820)? +-..+ (479)2 


Replications: 


18 — 278,596 = 10,204 


2 2 2 
eae (2981) toe + (2844) 


Error (a), by subtraction: 11,538 — 10,204 — 135 = 1,199 


— 278,596 = 135 


TAT 


Recipe I 


Totals 


Recipe II 


Totals 


Recipe III 


Totals 


Temp: totals 


NUMERICAL EXAMPLE 


TABLE 7.5 BREAKING ANGLES (DEGREES) 
Temperature 
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Step 2. This concerns the sub-unit treatments. Their main effects are 
obtained directly. 
(1259)? + (1848)? +---+ (1591)? 


Temperatures: is — 278,596 = 2100 


The sum of squares for interactions between sub-unit and whole-unit 
treatments is found by subtraction. First calculate the total s.s. for the 
two-way table that shows both sets of treatments. 


Total treatments: 


(437)? + (473)? +--+ (616)? + (536)? 
15 


278,596 = 2441 


Then, 
Recipes X temperatures: 2441 — 135 — 2100 = 206 
Step 3. Compute the total s.s. among all sub-units. 
Total: (42)? + (47)? ++--+ (85)? + (25)? — 278,596 = 18,143 
The sum of souares for error (b) is then found by subtraction in table 
7.6. 


TABLE 7.6 ANAtysIs OF VARIANCE OF BREAKING ANGLES (ON A SUB-UNIT 
BASIS) FOR THE EXPERIMENT ON CHOCOLATE CAKES 


Source of variation df. S.8. ms. F 
Replications 4 10,204 
Recipes 2 135 67.5 1.58 
Error (a) 28 1,199 42.8 
Temperatures 5 2,100 420.0 20.49 

Linear regression 1 1,967 1,967.0 95.95 

Deviations 4 133 33.2 1.62 
Recipes X temperatures 10 206 20.6 1.00 
Error (6) 210 4,299 20.5 

Total 269 18,143 


The mean square for recipes, while above that for error (a), does not 
approach the 5% level. Temperature effects are highly significant as 
compared with error (b). Since there is a fairly steady increase in the 
breaking angle with increased temperature, the sum of squares has been 
divided into the component due to a linear regression on temperature 
and that due to deviations from the straight line. The regression coeffi- 
cient amounts to an increase of 1.6° in the breaking angle for each 10° 
rise in baking temperature. The mean square for deviations is not sig- 
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nificant, though it is higher than expectation. There is no indication of 
any interaction. It will be observed that the error (a) m.s. is about twice 
as large as that for error (6). 

In table 7.7 are shown the treatment means, with the principal stand- 
ard errors as obtained from table 7.4. 


TABLE 7.7 BREAKING ANGLE MEANS (DEGREES) 


Recipe 

Recipe 175° 185° 195° 205° 215° 225° means 
I 29.1 31.5 30.8 33.5 38.7 35.1 33.1 

08 26.9 29.4 31.7 32.1 34.5 35.3 31.6 
Il 27.9 28.9 31.7 30.9 34.4 35.7 31.6 


Temp. means 28.0 30.0 31.4 32.2 35.9 35.4 32.1 


Standard error of difference between 


2(42.8) 

Two recipe means: oe = 0.98 (28 d.f.) 
'2(20.5) 

Two temperature means: 4 > ) = 0.95 (210 d.f.) 


1.65 (210 d.f.) 


; 2(20.5) 

Two temperature means for one recipe: iB 
2[5(20.5) + 42.8) 

Two recipe means for a given temperature: ae = 1.80 


The 5% levels of ¢ are 2.05 and 1.97, respectively, for 28 and 210 df. 
Consequently, the 5% level for the last standard error above is 


(5)(20.5)(1.97) + (42.8) (2.05) _ 
(5)(20.5) + (42.8) 
This value always lies between the two individual t-values. In practice 
it need rarely be calculated. 


1.99 


7.18 Missing Data. The formulae for inserting estimates of missing 
values have been developed by Anderson (7.3). Suppose first that the 
observation for a single sub-unit is missing, and that this sub-unit re- 
ceives the treatment combination (a;b;). Let 

U = total for the unit containing the missing observations. 
(A;B;) = total of all sub-units that receive the treatment combination 


(a;b;). 
(A,) = total of all observations that receive the ith level of A. 
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Then the estimate to be inserted for the missing value is 
rU + B(A;B;) — (A;) 
~ @=D6-D 
Example. Suppose that the observation for recipe II, 195° temper- 
ature, in replication 2 is missing. (Its actual value is 47.) Then 
U = 280 — 47 = 233; (A;B;) = 476 — 47 = 429; 
(A;) = 2848 — 47 = 2801 
15(233) + 6(429) — 2801 es 3268 =47 
(14) (5) 70 


By accident, the estimated value is the same as the actual value. If 
several observations are missing, repeated use is made of the formula as 
described in section 4.25. 

In the analysis of variance 1 d.f. is subtracted from the error (6) d.f. 
for each missing observation. An unbiased estimate of Ey is obtained, 
but the treatment sums of squares and that for error (a) are biased up- 
wards. If only a small fraction of the values is missing, it appears that 
these biases can be ignored; methods for obtaining unbiased estimates are 
given by Anderson (7.3). 

Standard errors. When there are missing observations, the formulae 
in table 7.8 are suggested for the differences between 2 means. 


TABLE 7.8 Sranparp ERRORS FoR sPLIT-PLoT EXPERIMENTS WITH 
MISSING DATA 


Treatment comparison 8.e. 
Difference between two A means: V 2[Ea + fEy)/r8 
Difference between two B means: V 2Ev[l + (f8/c)]/re 
Difference between two B means at the same 
level of A: V 2B[L + (f8/e)]/r 
Difference between two A means at the same 
level of B: * VB a/r8) + (2E)[(8 — 1) + f6"]/rB 


If only one value is missing in the experiment, and if a mean con- 
taining that value is compared with another mean, the factor f is 
1/2(r — 1)(8 — 1), and the formulae are exact, reference (7.3). 

With more than 1 missing observation, the value of f depends on the 
locations of the missing observations. The following approximation, 


— 


et Sa ces 


re 
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developed by G. S. Watson, is correct for a number of cases but other- 
wise tends to be slightly too high. 

k 
~ Wr — dB —k+e—1) 
When counting the values of k, c, and d, as defined below, be sure to 


ignore all missing observations except those that occur in the 2 means 
that are being compared. 


f 


k = number of missing observations. 

c = number of replications which contain 1 or more missing observa- 
tions. 

d = number of missing observations in the sub-unit treatment (a,b;) 
that is most affected. 


For example, suppose that in the cake experiment the values for recipe 
I, 175° temperature, are missing in replications 1, 2, and 3, while those 
for recipe I, 185° temperature, are missing in replications 3 and 4. We 
are comparing the means for the 2 temperatures, taken over all recipes. 
Then k is 5, and cis 4. The sub-unit treatment with the most missing 
values is recipe I, 175° temperature, which has 3 missing, so that d is 3. 

The case where a complete unit is missing is discussed by Anderson 
(7.8). 

7.2 Repeated Subdivision 


7.21 Description. In order to include a new factor C at 7 levels, 
the sub-units may each be divided into sub-sub-units. There are three 
experimental error variances. Errors (a) and (b) have the same functions 
as described previously and are calculated in the same manner, except: 
that an extra divisor y is introduced into all sums of squares in order to 
present the analysis on a sub-sub-unit basis. Error (c), which will usually 
be the smallest of the three, applies to the C, AC, BC, and ABC effects. 
The process of subdivision may be carried as far as is convenient. 

The partition of degrees of freedom has already been given (table 7.3) 
for factors A and B. For C and its interactions. the partition is shown in 


table 7.9. 


TABLE 7.9 ANALYSIS OF VARIANCE FoR C AND ITS INTERACTIONS 


Effect df. 

(os (Ce) 

AC (« — I)y — 1) 
BC (8 — I — 1) 
ABC (a — 1)(6 — I(y — 1) 


Error (c) af(r — 1)(y — 1) 
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The computations in the analysis of variance are a straightforward 
extension of those for the simple split-plot design. The sum of squares 
for AC, for instance, is obtained by calculating the total s.s. for the A 
by C two-way table, and subtracting the sums of squares due to A and 
C; and similarly for the other factorial effects in table 7.9. Finally, the 
sum of squares for error (c) is found by calculating the total s.s. among 
sub-sub-units, and subtracting the contributions from all other items in 
the analysis. Worked examples are given in references (7.4) and (7.5). 


7.22 Standard Errors, The formulae in table 7.4 for standard errors 
applicable to A and B effects remain valid apart from division by an ad- 
ditional factor +/7. For the principal comparisons which involve C ef- 
fects, standard errors are shown in table 7.10. All treatment means and 
error mean squares are assumed to be on a sub-sub-unit basis. 


TABLE 7.10 SraNDARD ERRORS FOR THE SPLIT-PLOT DESIGN WITH TWO 


SUBDIVISIONS 
Treatment comparison S.e. 
[(ex) — (co)] ae 
larei) — (ax) = 
rp 
[(ares) — (aoe) ry 
rBy 
(xe) — Cxea] V= 
[@rex) — Goed] p ae aT 
rary 


7.3 Some Variants of the Split-plot Design 


7.31 Systematic Arrangement of the Treatments Applied to the Units. 
The device of subdividing each unit into a number of sub-units is very 
flexible, and can be used with any type of design in which the units are 
arranged. Some of the more useful variations that are possible are de- 
scribed in the remainder of this chapter. The first arises when the nature 
of the experiment makes it necessary or desirable to have the A treat- 
ments arranged in a systematic design. This arrangement has been 
used in experiments where the A treatments were varieties of wheat 
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known to mature at different dates, and the B treatments were prepara- 
tions applied to the seed before sowing. Since each variety was to be 
harvested when ripe, the field operations were made easier by planting 
the varieties in the order in which they would be ready for harvest. The 
field plan is illustrated in table 7.11, where a; is the earliest variety and 
as the latest. 


TABLE 7.11 EXxaMPLe oF SYSTEMATIC ARRANGEMENT OF TREATMENTS 
APPLIED TO UNITS 


ay ay ag a4 as 


bo| |b) |b1] |bo} do 


Rep. I 
by bo bo by by 
4 % a3 45 
bo bo] | br bi] | bo 

Ir 


by by bo bo by 


meee 


The sub-unit treatments are randomized within each unit; the unit 
treatments, however, appear in separate strips. 

This arrangement provides no valid estimate of error for the A main 
effects, or for comparisons such as (4109 — abo) that involve A effects. 
Thus the “whole-plot” analysis of variance is irrelevant. In the sub-plot 
analysis of variance, error (b) is still valid for testing the B effects and 
the AB interactions. One point should be watched. A comparison such 
as (a,b; — abo) is derived entirely from the first strip in the plan. Con- 
sequently, if the sub-plot error variance differs from strip to strip, the 
use of the error (b) m.s. for testing this quantity is not justified; instead, 
it is necessary to compute a separate error using only data from the first 
strip. 

To summarize, systematic placement of the A treatments is advisable 
only where it is essential in order to conduct the experiment and where 
no test of the A main effects is required. 


7.32 Sub-unit Treatments in Strips. In a further variant the sub- 
unit treatments, instead of being randomized independently within each 
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unit, are arranged in strips across each replication. For a 5 X 3 design 
the appropriate rearrangement (after randomization) might be as shown 
in table 7.12. 

This layout may be convenient for field experiments where it is neces- 
sary to test both factors on relatively large areas and to leave free access 


TABLE 7.12 DesiaN with SUB-UNIT TREATMENTS IN STRIPS 


Rep. I 
a3 ay 2) a as 
be 
bo 
by 
Analysis of variance 
Rep. IL df. 
a ay a a2 a3 Replications 2 
A 4 
b Error (a) 8 
B 2 
be Error (b) 4 
AB 8 
bo Error (c) 16 
Total 44 
Rep. III 2 
a a3 a ag a4 
ba 
by 
bo 


at both ends. Similar conditions may apply in other types of experi- 
mentation. 

As with the ordinary split-plot design, the average precision over all 
treatment comparisons is the same as that of randomized blocks arranged 
on the sub-units. The present design sacrifices precision on the main ef- 
fects of A and B in order to provide higher precision on the interactions, 
which will generally be more accurately determined than in either ran- 
domized blocks or the simple split-plot design. Since in addition the 
numbers of degrees of freedom for estimating the errors of the A and B 
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effects are likely to be small, the design is not recommended unless prac- 
tical considerations necessitate its use or unless the interactions are the 
principal object of study. Both the A and B treatments should be ran- 
domized independently within each replication. 


TABLE 7.13 Lavin sQUARE DESIGN WITH SUB-UNIT TREATMENTS IN STRIPS 


a2 ay a a4 ag 


by 
bo 
ba 
a4 a3 a a2 a 
by 
: | 
be Analysis of variance 
df. 
ag ag ag ay a4 Rows 4 
Columns 4 
be A 4 
Error (a) 12 
by B 2 
Error (b) 8 
bo AB 8 
Error (c) 32 
a a a4 ag a Total 74 
by 
be 
bo 
a3 a4 a2 % a 
: heal 
be 
by 
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In the statistical analysis, separate estimates of error are obtained for 
A, B, and AB. Both sets of main effects may be regarded as tested in 
randomized block designs, in which the sum of squares for replications 
(blocks) is the same for both designs. A worked example (on uniformity 
data) is given in reference (7.6). 

Formulae for the estimated standard errors of the differences between 
treatment means per sub-unit are given in table 7.14. The mean squares 
Hq, Ey, and E, are assumed to be on a sub-unit basis. 


TABLE 7.14 STANDARD ERRORS WHEN SUB-UNIT TREATMENTS ARE IN STRIPS 


(a = number of levels of A; 8 = number of levels of B;r = number of replicates) 


Treatment comparison 8.e. 
[(@) — (ao)] V 2E4/r8 
(1) — (bo)] V 2B p/ra 


[(aiby) — (aobr)] V 2[(8 — 1)B. + Eo)/rB 
[(axb1) — (arbo)] V2i(a@ — 1)E. + Ey)/re 


One of the factors may be arranged in a latin square. Witha 5 X 3 
design in 5 replicates, a layout of this type is shown in table 7.13. 

The statistical analysis is straightforward. For the B treatments, the 
“blocks” of the randomized blocks design are the rows of the latin square. 
The formulae given above for the standard errors are applicable (with r 
equal to a). 


7.33 Sub-unit Treatments in a Latin Square. In certain cases the 
sub-unit treatments may be arranged in a latin square, with the prospect 
of a further increase in accuracy on sub-unit comparisons. There are 
two types of layout, of which the first will be described in this section; 
the second, which is used when there are only a few whole-unit treat- 
ments whose main effects are not required with high accuracy, is dis- 
cussed in section 8.6. 

Tn the first type the number of replications must be a multiple of the 
number of sub-unit treatments. For each whole-unit treatment, the 
sub-unit treatments are arranged in one or more separate latin squares. 
An example with 4 whole-unit treatments, 3 sub-unit treatments, and 3 
replications is shown in table 7.15. The latin Squares are not very clearly 
displayed in the plan. If we examine the three units that receive az, it is 
seen that each b appears once in the top sub-unit, once in the middle, 
and once at the bottom. Thus the rows of the latin square are the posi- 
tions of the sub-units within the unit. The only change from the ordi- 
nary split-plot arrangement is that differences among these rows are 
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eliminated from the sub-unit errors. If these differences are large, sub- 
unit comparisons are more precisely determined than in the ordinary de- 
sign. Whole-unit comparisons are unaffected. 

The designs are useful where a fairly consistent gradient is expected 
within each unit. Examples have been quoted previously. In experi- 


TABLE 7.15 Sprir-PLor DESIGN WITH SUB-TREATMENTS IN A LATIN SQUARE 


ag ag ay a 


Reape 1 Ga an Ba eo Analysis of variance 
by by by be Units df. 
Replications 2 
A 3 
SOf epee ee Error (a) 6 
bo bo by hy Total u 
TE [ba) |b} | be be Sub-units 
Fal lea |Eal | Bal Rows 8 
b b b b 
M1 H 0 0 B 2 
AB 6 
a3 a a, CT Error (6) 8 
oa a 
bo} |b2| |ba] [be Total 24 


Ul jh bi} | br bo 


ments on plant viruses, the sub-units may be the top, middle, and lower 
leaves of a plant, with a regular gradient of susceptibility down the 
plant. The sub-unit treatments may be operations performed in suc- 
cession by the same person, or drugs injected in succession to the same 
animal, where a time-trend is anticipated. 

The number of replicates may be any multiple of the number of sub- 
unit treatments. With 6 replicates of the design in table 7.15, the ex- 
perimental material is divided into 2 groups of 3 replicates each. If 
practicable, the division should be made in such a way that the within- 
unit gradient is constant in a given group. Since the rows are eliminated 
separately for each group, changes in the gradient from group to group 
do not increase the error. 

In the randomization the first step is to randomize the units according 
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to the instructions for the design in which they are arranged. For each 
whole-unit treatment a latin square of the size required to accommodate 
the sub-treatments is then randomized as described in section 4.33. A 
new randomization of the latin square is taken for each successive whole- 
unit treatment. The first column of each square is assigned to the first 
replication of the experiment, and so on. 

For extra replication the basic plan is repeated with a new randomiza- 
tion. 

Since the whole-unit analysis of variance is unaffected, it is necessary 
to show only the subdivision of the degrees of freedom among sub-units. 
Table 7.16 applies to the case in which the basic plan is repeated k times. 


TABLE 7.16 Partition OF DEGREES OF FREEDOM AMONG SUB-UNITS WHEN 
SUB-TREATMENTS FORM LATIN SQUARES 


df. 
Rows ko — 1) 
Sub-unit treatments (6 — 1) 
Sub-unit < whole-unit treatments (a2 — 1)(@ — 1) 
Error (6) ae NG = 1.— 8) 
Total ra(B — 1) 


Notation: « = number of whole-unit treatments. 
8 = number of sub-unit, treatments. 
r = number of replications = kp. 


The number of degrees of freedom for rows should be noted. Since 
each latin square supplies (8 — 1) degrees of freedom for rows, every 
whole-unit treatment contributes k(8 — 1) degrees of freedom, so that 
the total is %e(8 — 1) degrees of freedom. The sum of squares for rows 
is calculated by addition of the sums of squares for each of the ka latin 
squares. The sum of squares among the columns of the latin squares 
constitutes a whole-unit comparison and does not appear in the portion 
of the analysis shown above. 

Before using this design it is advisable to verify that the number of 
degrees of freedom for error (b) is adequate. In the example in table 7.15 
there are only 8 d.f., as against 16 with an ordinary split-plot arrange- 
ment. From section 2.31 it appears that the sub-unit error variance 
must be reduced by at least 15% in order to compensate for the decrease 
in degrees of freedom. 


7.84 Split-plot Technique with Confounded Designs. The treat- 
ments applied to the units may constitute a factorial system arranged in 
a design that confounds certain interactions completely or partially. 


7.35 SUB-UNIT TREATMENTS CONFOUNDED 237 


Since experimenters are sometimes uncertain how to compute the analysis 
of variance in such cases, the following notes indicate the method. 

1. The totals for each unit are analyzed by the instructions for the 
confounded design in which the units are arranged. 

2. In computing the sums of squares for the sub-unit treatments and 
for all interactions between sub-unit and whole-unit treatments, the 
confounding is ignored. Even if a whole-unit factorial effect is com- 
pletely confounded, its interactions with sub-unit treatments are un- 
confounded and may be tested by means of error (b). 

3. The sum of squares for error (b) is found as usual by subtracting the 
total s.s. on the whole units plus the total s.s. for sub-unit treatments 
and for interactions between sub-unit and whole-unit treatments from 
the total s.s. for sub-units. 

A difficulty arises occasionally in the presentation of tables. Suppose 
that a3 X 3 X 2 design is applied to the units in blocks of 6 units, with 
AB and ABC partially confounded, and that a factor D at 2 levels is ap- 
plied to the sub-units. In the A by B two-way table, the entries must be 
adjusted to eliminate block effects. The problem is how to present the 
ABD three-way table. The observed means (agbodo) and (agbod;) will 
average to the wnadjusted mean for (apbo), which contains block effects. 
The simplest process is to calculate the difference between the adjusted 
and the unadjusted means of (agbo). This difference is then applied to 
(agbodo) and (aobod;). Notice that this adjustment does not change the 
difference between these two entries and brings their mean to the correct: 
value. ‘ 


7.35 Confounding of Comparisons among Sub-unit Treatments, 
When the sub-unit treatments are factorial, it may be advantageous to 
confound certain interactions among these factors with the units. The 
technique, first presented by Yates (7.7), is illustrated in table 7.17, 
which shows a single replication of the design. The factor A applied to 
the units has 2 levels, while the sub-unit treatments are the 8 combina- 
tions of factors B, C, and D at 2 levels each. The ordinary split-plot ar- 
rangement is shown in (i), while (ii) and (iii) are two alternatives for 
the new method, 

Consider first (i) and (ii). In the ordinary arrangement the unit con- 
tains 8 sub-units, to which the 8 sub-treatments are allotted at random. 
With the new arrangement the sub-units are in groups of 4, chosen so 
that BCD is confounded with group totals. The advantage of the new 
layout.is that the size of the whole-unit has been reduced from 8 to 4 sub- 
units. This will in general lead to more accurate estimates both of the 
A effects and of the sub-unit treatments. The only exceptions are the 
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high-order interactions BCD and ABCD. In the ordinary arrangement 
these interactions are derived from comparisons among sub-units in the 
same unit, whereas in the new design they are estimated from com- 
parisons among the units, as the reader may verify. 

Arrangement (iii) goes a step further. The a and a; treatments which 
have opposite sets of sub-unit treatments are placed in the same sub- 


TABLE 7.17 ConrounpInG oF BCD IN A SPLIT-PLOT EXPERIMENT 


j. Ordinary arrangement ii. New arrangement 
a ay a a ay ay 

ba | a | [bea] b a) {o @] fe 
c cd d (1) bd bed be d 
bed | b bd | be (1) d ed bed 

(1) | be c ed be c bd b 


iii. New arrangement in sub-blocks 


Block Ia Block Ib 
ao ay do Q 
cd () b c 
bd be bed d 
(1) ed d bed 
| be | | bd c jb | 


block. The result is that ABCD is completely confounded between sub- 
blocks, while BCD, like A, is orthogonal with sub-blocks. The aim is to 
sacrifice information on ABCD in the hope of obtaining more precise 
estimates of A and BCD. Consequently, alternative (iii) is usually 
preferable to (ii). 

With 5 replications of arrangement (iii), the partition of degrees of 
freedom is set out in table 7.18 in condensed form, There are 80 sub- 
units and 10 sub-blocks. 


TABLE 7.18 ParrivioN OF DEGREES OF FREEDOM IN A 2 X 2° DESIGN WITH 
SPLIT-PLOT CONFOUNDING 


Units df. Sub-units df. 


Between sub-blocks 9 B, C, D, BC, BD, CD 6 
A 1 AB, AC, AD, ABC, ABD, ACD 6 
BCD 1 Error (6) 48 
Error (a) 8 —_ 


Grand total between sub-units 79 
Total between units 19 
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All sums of squares are calculated in the usual way. Note that ABCD 
does not appear, since it is a component of the sum of squares between 
sub-blocks. 

When the sub-unit treatments form a 2° system, a similar technique 
may be used for any number of levels of the factor A. Single replications 
of the layouts are shown in table 7.19 for A at 2, 3, 4, 5, and 6 levels. 


TABLE 7.19 SpuiT-PLOT DESIGN FOR AN a X 2° FACTORIAL SYSTEM 


Arrangement of whole-units in a single replication 


a = 2;(2 x 2) @ = 313 X 2) 
Ta Ty Ia Tp 
ao | ay’ ay | a ay | a | ae ay’ | ay’ | aa! 
a = 4;(4 X 2°) 
Ta IL 
ao | ay’ | ae | as! ao’ | a | ae’ | as 
a = 5;(5 X 2%) 
Iq Th 
ao | a | a2 | ag | a4 ay! | ay’ | a! | ag!’ | ay! 
a = 6;(6 X 2°) 
Te Ty 
ay | ay’ | ae | ay’ | ag | as’ ao’ | ay | ae’ | ag | a4’ | a5 


Each unit contains 4 sub-units. A prime (’) implies that the sub-unit treatments 
are (1), be, bd, and ed, and an unaceented a that they are b, c, d, and bed. 


The arrangements differ slightly according as the number of levels of 
A is even or odd. With an even level, one component of ABCD is con- 
founded between sub-blocks. For example, with A at 4 levels, this 
component is (ag + a1 — a2 — ao) BCD. When A has an odd number 
of levels, we are forced to confound BCD at least partially with sub- 
blocks, and for simplicity it has been completely confounded. 

The randomization for a single replicate is as follows. (i) Allot the 
sub-blocks a and b at random to the 2 halves into which the replicate has 
been divided. (ii) Allot the levels of A at random to units within each 
sub-block. (iii) Allot the appropriate set of 4 sub-treatments at random 
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within each unit. A separate randomization is used in each replication. 
The partition of degrees of freedom in the analysis of variance is shown 
jn table 7.20 for r replications of the basic design. 


TABLE 7.20 Partirion OF DEGREES OF FREEDOM FOR AN a X 2° spLiy-PLOT 


DESIGN 
Units Sub-units 

a even d.f. a odd df. df. 
Sub-blocks (2r — 1) (2r — 1) Sub-treatments 6 
A (a — 1) (a — 1) A X sub-treatments ~6(a — 1) 
BCD 1 Error (0) 6a(r — 1) 
ABCD (e — 2) («=1) Total Gar 
Error (a) 2(@—1)(r —1) 2a —1)(r — 1) 

Total (2er — 1) (2er — 1) 


When a is odd, the sum of squares for BCD is omitted from the treat- 
ments; otherwise all sums of squares are obtained in the usual way. 
With a even, 1 df. is missing from ABCD, since it is confounded with 
sub-blocks. Since these designs are unlikely to be used unless ABCD is 
small, it will usually be satisfactory to combine the ABCD sum of 
squares with error (a). The pooled sum of squares can be found by sub- 
traction without calculating that for ABCD. 

A considerable number of arrangements of this type are possible, 
though some care is required in their construction in order to avoid 
confounding important comparisons. Finney (7.8) gives an account of 
the principles of construction and designs with worked examples, for a 
4 X 2? and a 6 X 2? factorial, where the units are ina 4 X 4 and 6 X 6 
latin square, respectively. Yates (7.7) gives a similar design for 6 X 2° 
factorial. 
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CHAPTER 8 


FACTORIAL EXPERIMENTS CONFOUNDED IN QUASI-LATIN 
SQUARES 


8.1 Introduction 


Yates (8.1) has constructed a number of confounded arrangements for 
factorial designs in latin squares. In the first group of designs (plans 
8.1-8.7), some of the interactions among the factors are confounded with 
the rows and columns of the squares. These designs resemble ordinary 
latin squares in that the differences among the rows and columns of the 
squares are eliminated from the experimental errors of the unconfounded 
treatment effects; however, the designs do not possess the typical latin 
square property that each treatment appears once in every row and every 
column. Yates (8.1) has proposed the name quasi-latin squares. 

Before one of these plans is used, it is advisable to note the interactions 
that are confounded and the extent to which they are confounded, par- 
ticularly since more treatment effects must be confounded with latin 
squares than with randomized blocks. Of the eight plans shown, only 
one involves the confounding of two-factor interactions. 

In the statistical analysis, the sums of squares for rows, columns, and 
all unconfounded treatment effects are calculated in the usual way. 
Partially confounded effects, if considered negligible, may be combined 
with the error, which is found by subtraction; or they may be tested for 
significance by calculating their sums of squares from those portions of 
the experiment in which the effects are unconfounded. 

In a further group of designs (plans 8.8-8.14), certain treatments are 
applied to complete rows or columns of a latin square. These designs 
have been called half-plaid squares or plaid squares, from the resemblance 
of the plan to a Scotch plaid. 


8.2 Randomization of Quasi-latin Squares 


The rows and the columns of each latin square are rearranged at ran- 
dom. In certain plans (e.g., 8.2) each square contains several replica- 
tions. The randomization must not be restricted in such cases with the 


object of keeping the replications separate. 
241 
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8.3 Notes on the Plans and Statistical Analysis 


2 factorial in a 4 X 4 square. There are a number of alternative de- 
signs, of which two are given in plan 8.1a and 8.1b. In the former, which 
requires 4 replications, 44 relative information is retained on AB, AC, 
BC, and ABC, main effects being unconfounded. On account of the high 
degree of confounding of the two-factor interactions, this plan is not 
recommended unless the factors are presumed to act independently or 
unless the 4 X 4 square is considered much more accurate than alter- 
native designs. 

A numerical example illustrates the general method of analysis for all 
designs presented here. The data in table 8.1 represent twice the yields 


TABLE 8.1 Yre.ps AND STATISTICAL ANALYSIS OF A 2° FACTORIAL IN TWO 
4 X 4 QUASI-LATIN SQUARES 


Square I Square II 
ABC AC AB BC 
c (1) abe ab Totals C; a abe b Totals 
57.9 62.3 60.2 58.6 239.0 62.4 64.6 68.5 63.1 258.6 
AB ABC 
a ac b be (1) ac be ab 
66.2 60.5 60.0 53.4 240.1 61.5 62.7 58.6 69.3 252.1 
bot -UCme CL) a abe ob (1) ac 
64.1 61.3 63.2 59.2 247.8 66.0 56.5 63.5 67.3 253.3 
BC AC 
b ab ac c ab be a C 
58.4 61.8 67.0 61.9 249.1 65.9 52.6 58.1 58.1 234.7 
Totals 246.6 245.9 250.4 233.1 976.0 255.8 236.4 248.7 257.8 998.7 
d.f. 8.8. m.s. 
Squares 1 16.10 
Rows 6 100.90 16.82 
Columns 6 112.71 18.78 
A, B,C 3 146.43 
AB 1 18.92 18.92 
AC 1 0.06 0.06 
BC it 0.33 0.33 
ABC i? 8.27 8.27 
Error 11 101.86 9.26 


Total 31 505.58 
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in bushels per acre from a soil-treatment test on soybeans, at Muscatine, 
Iowa. The plots were split for a varietal comparison which has been 
omitted. For convenience, rows and columns were rearranged so that: 
the field plan corresponds to plan 8.1a. The factors were limestone (A), 
phosphorus (B), and potash (C); in each case the lower level implies no 
application of the treatment. 

To obtain the sums of squares for the treatment effects, first calculate 
the factorial effect totals in the usual way. These are shown in the first 
line of table 8.2. For the partially confounded effects, these totals must 


TABLE 8.2 TREATMENT EFFECTS FoR A 2° FACTORIAL IN 4 X 4 
QUASI-LATIN SQUARES 


[A] [B] (C] [AB] [AC] [BC] [ABC] 


Totals +65.3 —181 —9.7 +85.7 +349 -— 81 — 43 
From rows and columns +18.3 +35.9 -—104 + 7.2 
Adjusted totals +174 —10 423 —115 


be adjusted. Now AB is completely confounded with the first 2 rows of 
square I and the first 2 columns of square II. Calculate AB as obtained 
from these rows and columns, i.e., 


239.0 — 240.1 + 255.8 — 236.4 = +18.3 


This figure is placed in the second row of table 8.2, with corresponding 
figures for AC, BC, and ABC. The adjusted totals are found by sub- 
tracting the second from the first row. Of course, these represent just 
the totals for each effect as found from that part of the experiment in 
which the effect is unconfounded. The divisors for the squares are 32 
for main effects and 16 for the other effects. 

The two-way tables of means require adjustment. For the AB table 
the adjustment is 


This value is added (algebraically) to the (1) and (ab) means and sub- 
tracted from the (a) and (b) means. 

Plan 8.1b completely confounds ABC with squares; all other effects 
are unconfounded. The arrangement has the peculiarity that half the 
treatments appear only in square I, while the other half appear only in 
square II. Although this layout avoids the confounding of any two- 
factor interactions, Yates has pointed out that any interaction of the A 
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effect with squares will appear in the analysis as a BC interaction. The 
results are therefore likely to be misleading if the experimental material 
in square I differs in responsiveness from that in square II. Where this 
danger seems likely, Yates suggests that the columns (or rows) of the 
two squares may be interlaced, so that each square comprizes a ran- 
domly chosen half of the experimental material. A numerical example 
of the statistical analysis is given by Yates (8.1, p. 33), using uniformity 
data. 

2* factorial in an 8 X 8 square. Four replications are required for plan 
8.2. ABCD is completely confounded, and threefold replication is ob- 
tained on all three-factor interactions. The degrees of freedom sub- 
divide as follows. 


df. 
Rows 7 
Columns 7 
Treatments 14 
Error 35 


In the computation for ABC, ABD, ACD, or BCD, the appropriate pair 
of rows is deleted. 

2° factorial in an 8 X 8 square. The confounding in plan 8.3 (for 2 
replications) is not completely balanced. Hight of the 10 three-factor 
interactions and four of the 5 four-factor interactions receive only single 
replication. The remaining high-order interactions, ABE, CDE, ABCD, 
and ABCDE, are unconfounded. By a proper assignment of the letters 
a, b, «++ to the factors, any single three-factor interaction may be left 
unconfounded, though not any desired pair of interactions. 

The partition of degrees of freedom is: 


df. 
Rows 7 
Columns 7 
Treatments 31 
Error 18 


Tn the computation of the sum of squares for ABC, ADE, or BCDE, the 
4 rows in the experiment which correspond to the first 4 rews of plan 8.3 
are omitted. The 4 rows or columns to be omitted for any other partially 
confounded comparison are found similarly from plan 8.3. If the high- 
order interaction effects are likely to be negligible, all partially con- 
founded effects and ABCD and ABCDE may be combined with the 
error. This procedure gives 17 d.f. for treatments and 32 for error. 
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28 factorial in an 8 X 8 square. In each square of plan 8.4, 8 three- 
factor and 6 four-factor interactions are completely confounded. If the 
experiment has a single replication and if only the main effects and two- 
factor interactions are isolated for tests of significance, the degrees of 
freedom are allotted as follows. 


df. 
Rows 7 
Columns 7 
Main effects 6 
Two-factor interactions 15 


Error (estimated from high-order interactions) 28 


Any unconfounded three-factor interaction may also be tested. 

When there are several replications in the experiment, the squares 
should be chosen so that the most important three-factor interactions 
are free from confounding. The balanced design, which requires 5 repli- 
cations, gives 3/5 relative information on all three- and four-factor inter- 
actions. 

3° factorial in a 9 X 9 square. Three or six replicates may be used 
(plan 8.5). Four of the 8 df. for ABC are completely confounded in each 
square. The 2 squares together give balanced confounding of ABC, with 
1/2 relative information. 

When square I alone is used, the components of the analysis of vari- 
ance are: 


df. 
Rows 8 
Columns 8 
A, B,C 6 
AB, AC, BC 12 
ABC 4 
Error 42 


The 4 df. for ABC are evaluated as follows. Compute the total yields 
of the 3 groups of 9 treatments which are described as set II (a), (c), and 
(b) in square 2. The sum of squares of deviations of these 3 totals, with 
divisor 27, constitutes 2 of the 4d.f. The remaining 2 are found similarly 
from set IV (a), (b), and (¢) in square 2. 

With both squares, all 8 d.f. for ABC may be computed, sets I and III 
being taken from the experimental data for square II, and sets II and IV 
from the data for square I. 

8* factorial in a 9 X 9 square. Hach square in plan 8.6 provides 1/2 
relative information on ABC, ABD, ACD, and BCD. For an experiment, 
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with a single replication, the simplest subdivision of the degrees of free- 
dom is: 


df. 
Rows 8 
Columns 8 
Main effects 8 
Two-factor interactions 24 


Error (estimated from high-order interactions) 32 


When 2 replications are available, both squares should be used. All 
8 df. for any three-factor interaction may be tested, though each is based 
on only a single replication. The calculation of the sums of squares for 
the three-factor interactions follows the method outlined for the 3* fac- 
torial in blocks of 9 units (section 6.32). Plan 8.6 indicates the replica- 
tion from which any component must be taken. Thus, if ABC is being 
calculated, we use the data from square II for ABC II and ABC IV and 
the data from square I for ABC I and ABC II. 

4X 2 factorial in an 8 X 8 square. Plan 8.7 requires 4 replications. 
The symbols 4’, A”, A’” refer to components of the A effects. 


A’ =a3+a,—0,;—4} A” = a3 — G2 — a) + A; 
Al” = ag — dg + a — a 


No two-factor interaction is confounded. 

In view of the small relative information on ABC (only 1/3 on the 
average), the sum of squares for this term will usually be combined with 
the error. In this event the analysis of variance reads as follows. 


dfi 
Rows if 
Columns 7 
Main effects and two-factor interactions 12 
Error (by subtraction) 37 


8.4 Other Quasi-latin Squares 


Yates (8.1) gives experimental plans which can be used for the follow- 
ing factorial combinations: 3° X 2 (6 X 6 square), 4 X 2 (8.8 
square), and 9? (9 X 9 square). Analogous to the last design are designs 
for 7? (7 X 7 square) and 5” (5 X 5 square). In a general discussion of 
methods for constructing such plans, Rao (8.3) presents designs for a 2s 
factorial in 4 X 4 squares. 

For certain factorial combinations of treatments, the lattice squares 
(chapter 12) and the incomplete latin squares (chapter 13) confound all 
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treatment effects to the same extent. These arrangements may be use- 
ful in cases where the experimenter does not wish to sacrifice information 
on the interactions. 


8.5 Estimation of the Efficiency of Quasi-latin Squares 


In many cases the precision of a quasi-latin square can be compared 
with that of some other design which might have been used. The data 
from the example in table 8.1 will illustrate two types of comparison. 

In this experiment a 2? factorial system applied to plots of soybeans 
was arranged in two 4 X 4 squares, with 1/2 relative information on 
AB, AC, BC, and ABC. As an alternative, the experiment might have 
been laid out in randomized incomplete blocks of 4 plots each, with 
complete confounding of ABC (plan 6.1). Since the columns were more 
compact in shape than the rows, we will suppose that the columns would 
have been chosen as blocks. 

We first estimate the error m.s. that would have been obtained with 
incomplete blocks. It is helpful to construct analyses of variance for the 
2 designs (table 8.3) for the case where there are no treatments. The 


TABLE 8.3 ANALYSES OF VARIANCE (APPLICABLE TO UNIFORMITY DATA) 


Quasi-latin square Incomplete blocks 
df. m.s. df. m.s. 
Columns 6 18.78 Blocks (columns) 6 18.78 
Rows 6 16.82 Intra-blocks 24 (11.15) 
Intra-row-and-column 18 9.26 


mean squares to be inserted in the table are explained in the next para- 
graph. 

With incomplete blocks, the degrees of freedom to be divided between 
treatments and error are the 24 d.f. within columns (3 from each of the 
8 columns). Since the randomization ascribes to treatments a random 
selection from these 24 d.f., the mean square for the 24 d.f. provides an 
unbiased estimate of the error appropriate to the incomplete block de- 
sign. The 24 df. subdivide into 6 representing variations among rows, 
and 18 representing intra-row-and-column variation. If the interactions 
partially confounded with the rows are regarded as negligible, the rows 
m.s. (16.82) from the quasi-latin square analysis provides an estimate of 
the 6 df. The quasi-latin square analysis does not supply information 
on the whole of the remaining 18 d-f., since in the experiment 7 of these 
were ascribed to treatments. By virtue of the randomization for the 
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quasi-latin square, however, the error m.s. (9.26), with 11 d.f., may be 
taken as an estimate of the mean square for the whole 18 d.f. 
Accordingly, an unbiased estimate of the error m.s. for the incomplete 
block design is 
18 X 9.26'+ 6 X 16.82 


24 


11.15 


For the estimation of main effects, the efficiency of the quasi-latin 
square relative to incomplete blocks is estimated as 11.15/9.26, or 120%. 
Owing to the confounding in the quasi-latin square, the relative efficiency 
on the interactions is only 60%. 

The quasi-latin square may also be compared with ordinary ran- 
domized blocks of 8 plots, where each replication is a pair of columns. 
In table 8.1 the column totals are as follows. 


Rep. I Rep. II Rep. II Rep. IV 
246.6 245.9 250.4 233.1 255.8 236.4 248.7 257.8 


With uniformity data, the randomized blocks error contains 28 d.f, 
(7 from each replicate). To obtain this error, the incomplete blocks 
error (24 d.f.) must be pooled with the 4 d.f. which measure differences 
between the 2 columns in each pair. Since the differences between the 
2 column totals in each replication are 0.7, 17.3, 19.4, and 9.1, respec- 
tively, the swm of squares for the 4 df. is 


(0.7)? + (17.3)? + (19.4)? + (9.1)? 
8 


= 94.87 


This gives [24(11.15) + 94.87]/28 = 12.95 as an estimate of the random- 
ized blocks error. The relative efficiency of the quasi-latin square is 
140% on main effects and 70% on interactions. 

The comparisons above require the assumption that treatment effects 
which are confounded with rows or columns are non-existent. By more 
complicated procedures, estimates can be made which avoid those as- 
sumptions. The relative efficiencies quoted above are slightly too high, 
because we neglected the effect of the additional error df. that would 
be available in the alternative designs (section 2.31). 


8.6 Treatments Applied to Complete Rows of a Latin Square 


A simple example (taken from plan 8.8) is shown below. The latin 
square treatments form a 2 X 2 factorial for factors B and C. An extra 
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factor A has been superimposed on the rows. The first 2 rows receive 
the lower level of A, and the last 2 rows the higher level. 


ABC 


© |b ¢ (QQ) be 

le b be (A) 
}+—__———___— A 

a|(Q) be b ¢ 

@ \be (1) cm 20 


This plan might be tried in a case where the extra factor was not suit- 
able for application to individual units. No precise estimate of the ef- 
fect of A would be expected, but some useful information on the inter- 
actions between A and the other factors might be hoped for. However, 
this device has one property that necessitates caution in its use. When- 
ever an extra factor is superimposed on the rows, certain interactions 
between that factor and the latin square treatments are automatically 
confounded with columns, as noted by Yates (8.2). In the plan above, 
the ABC interaction 


(abe) + (a) + (b) + (©) — (ab) — (ac) — (be) — () 


is the difference between the first 2 columns and the last 2 columns. On 
the other hand, AB and AC are free of column effects. In the plans pre- 
sented here, which are due to Yates (8.1), such confounding is confined 
as far as possible to high-order interactions. The reader who constructs 
his own designs should verify, before using them, what interactions have 
been confounded in this way. 

As indicated above, these designs are useful in experiments where (i) 
it is not easy to apply factor A to individual units, and (ii) accurate 
estimates of the interactions of A, but not of its main effects, are wanted. 
For instance, in an experiment on an irrigated crop, we might wish to 
find out how the responses to plant nutrients B and C are affected by the 
level of the water supply. Factor A could then represent a restricted 
and an abundant amount of irrigation water, applied to whole rows of the 
square. 

These half-plaid squares are related both to quasi-latin squares and to 
split-plot designs. The difference from quasi-latin squares is that in the 
latter the confounding is confined to interactions, whereas main effects 
also are confounded in half-plaid squares. The connection with split- 
plot designs is seen when the rows are regarded as the experimental 
units, and the units (or plots) as sub-units. From this point of view the 
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plan above may be considered as a split-plot design with the sub-units 
arranged in latin squares. 

A moderately large number of useful half-plaid squares can be con- 
structed; only five of the simplest are given here. In the notation em- 
ployed, the factorial system for the latin square treatments is placed in 
parentheses (_ ), while the number of treatments imposed on the rows 
js shown before the parentheses. Thus plan 8.8 shows a 2 X (2?) fae- 
torial. 

As with an ordinary latin square, rows and columns must both be per- 
muted at random. The whole-row treatments cannot be arranged in 
separate blocks. 

The statistical analysis may be followed from the partition of the de- 
grees of freedom, which is given in condensed form with each plan. As 
in the split-plot design, there are two errors. The error for the factor A 
applied to the rows is derived from the sum of squares among rows minus 
the sum of squares for A. 

Most of the other treatment effects are unconfounded. Their sums of 
squares are calculated in the usual way and tested against the latin 
square error. In plans 8.9 and 8.10 certain interactions are partially con- 
founded with columns; the calculations for these effects are described 
in the notes below. In the remainder of the plans, some interactions are 
completely confounded with rows or columns and consequently do not 
appear in the treatments s.s. The error s.s. is found by subtraction. 


8.61 Notes on the Plans and Statistical Analysis. 2X (2*) factorial 
ina4 X 4 square (plan 8.8), Atleast two squares (4 replications) should 
be used, 

2X (8 X 2) factorial in a 6 X 6 square (plan 8.9). Each square con- 
tains 3 replicates. BC and ABC are partially confounded with columns, 
the relative information being 8/9 and 5/9, respectively. The sums of 
squares for these effects are calculated by the procedure for the 3 X 2? 
factorial in incomplete blocks of 6 units, where the columns constitute 
the blocks. A numerical example is given by Yates (8.1, p. 58). Col- 
umns Ia --- IIIb are identical, respectively, with Yates’ blocks Ia --- 
IlIb. Notice that the factor applied to the rows is C. 

8 X (8 X 2) factorial in a 6 X 6 square (plan 8.10). Each square re- 
quires 2 replicates, while the 2 squares comprize a balanced set. Col- 
umn effects are partially confounded with AB and ABC, the average 
relative information being 7/8 and 5/8, respectively. If the columns are 
regarded as incomplete blocks, the numerical example in section 6.19 
may be followed for the computation of the sums of squares for AB and 
ABC. 
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The statistical analyses for the 2 X (2°) factorial (plan 8.11) and the 
2 X (2%) factorial (plan 8.12) in 8 X 8 squares are straightforward. In 
the latter, the BCDE and ABCDE interactions have been combined 
with the error for A. 

Other designs. Yates (8.1, p. 79) presents the plans for a 4 X (23) fae- 
torial in an 8 X 8 square and (8.1, p. 80) fora 3 X (3°) factorial in a 
9 X 9 square. Plans can also be constructed for a 2 X (2°) system in a 
8 X 8 square and fora 3 X (3?) system in a9 X 9 square. 


8.7 Treatments Applied to Complete Rows and Columns of a Latin 
Square 


In these designs one set of treatments A is applied to complete rows 
of the latin square and another set B to complete columns. Although 
relatively low precision is obtained on the row and column treatments, 
the latin square treatments and their more important interactions with 
the A and B treatments are subject only to the latin square error. The 
number of useful designs appears to be limited. 

Two examples are shown in plans 8.13 and 8.14. The latin square 
treatments form a 2? and 2* system, respectively; the additional treat- 
ments are both at 2 levels. No two-factor interactions are confounded. 
The rows and columns of each square should be completely randomized. 

The subdivision of degrees of freedom is indicated on each plan. In- 
teractions which are confounded with rows and columns must be omitted 
from the treatments s.s.: all other treatment effects are calculated in the 
usual way. In plan 8.14, where the square contains a single replication, 
the only available estimate of error comes from the high-order inter- 
actions, unless two or more squares are used. 

A 9 X 9 square for a 3” system of treatments, with row and column 
treatments also at 3 levels each, is given in reference (8.1), p. 81. 
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Plan 8.1b 


Plan 8.2 
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QUASI-LATIN SQUARES 


PLANS 
2° factorial in two 4 X 4 quasi-latin squares 
Square I Square IT 
BC AC AB BC 
(1) | abe ab c a |abe b 
ac |b be ae (1) ac] be ab aoe 
be | (lI) a abe b | (1) ac 
ab }ac oc” Bo ab bela c kG 
23 factorial in two 4 X 4 latin squares 
ABC completely confounded with squares 
Square I Square II 
(1) be ac ab b a abe 
ac ab (1) be abe c ae 
be (1) ab ac Gen DG abe 
ab ac be (1) c abe bo oa 
24 factorial in an 8 X 8 quasi-latin square 
ABCD 
c abcd b ad a bd abe cd 
abd (1) bed be aod ac d ab ABC 
d tbe a abed b cd abd ac 
bod ad acd bd abe ab ¢ (1) “BP 
a b@ ec ab d_= abcd acd be 
abe ac abd cd bed (1) b ag ACP 
b aby d (ae. ¢ ad bed = abed 
acd cd abe (1) abd bc a tg BOD 
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Plan 8.8 2° factorial in an 8 X 8 quasi-latin square 


ACE, BCD, ABDE ACD, BDE, ABCE 


(1) abe be ace abd acd  Obede de 
A ABC, ADE, BCDE 


bee ae e ab bed abde acde 
cde abcd bde ad abce b c 
bd ade cd abcde|be ce abe a 


abe ce acde bed (1) bde ad abe 


acd ede abce c ade ab e bd 
abde d a be ede be ace abed ABD, BCE, ACDE 


ae b abd de ac abede cd bee 


Plan 8.4 2° factorial in 8 X 8 quasi-latin squares 


Square I 
Columns confounded with Rows con- 
CDE, BDF, ABE, ACF, BCEF, ABCD, ADEF founded with 


abedef cef bf bde ae abe adf- ced 

cde abee a adef bef cf bd abedf se 
bdf af abcef abed ¢ be cdef ade ADE 
acf bedf def (1) abd acde abef bee CEF 
be ace abde abf df bedef e acef ACDE 
ef abdef acdf ace tbede d bef ab BODE 
ad b ce cdf  abef aef  abede hdef ABEF 


abe de bed beef acdef abdf ac #f 


Square IT 

ABC, CDE, ADF, BEF, ABDE, BCDF, ACEF 
abcdef by ed) bee abd ac aef de 
ea aa : def acf abdf abcde beef see 
ede a abed acef df bef be abdef ABD 
adf ip bdef (1) acde abe abef bed ABP 
abe le ce bedf abef acdef ad ff BDEF 
bd abce ade abf  bedef ¢ c acdf ACDE 
bef abedf af abde be edef ace ABCE 
of adef abcef acd e@ bcde bd ab 

Square III 

ABP, CDF, ADE, BCE, ABCD, BDEF, ACEF 
abcdef be de; be; abd af ace cd 
rege a ob TOE! Mere iaieelas mast atest ate, 
cdf a abdf acef de bef be abcde BCR 
ade cof bede (1) acdf abc  abef  bdf ACE 
abf bed of bdef abce acdef ad € ABEF 
bd abcf acd abe bedef ce f adef ACDE 
bee abdef ae abcd bf d cdef act BCDE 
of acde abcef adf c¢ bedf bde ab 
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Plan 8.4. (Continued) 
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2° factorial in 8 X 8 quasi-latin squares 


Square IV 
BCD, ACE, ABF, DEF, ABDE, ACDF, BCEF 
ab acd ce bde bef = df aef _abcdef 
de bee abcd a acdef abef bdf cf 
c bd abe acde af abedf beef def 
bef cdef acf  abdf abce ade (1) bed 
adf abcf bedef ef cd b abde ace 
acef abdef bf cdf e bede abe ad 
abcde ae be bdef cef acdf abf 
bedf =f adef abcef abd ac cde be 
Square V 
ACF, BCD, ADE, BEF, ABDF, CDEF, ABCE 
: abedef bdf be acd de cef abe af 
ce a acdf bef  abef abcde def bd 
bed abde abcef cdef adf ac e 
bey abef abd (1) ace adef bede cdf 
ade ed f abdf bedef be acef abe 
ab bee bdef aef d abedf acde 
acf ef cde abce b bedf ad abdef 
df acdef ae bde abcd abf c beef 
Plan 8.5 3° factorial in 9 X 9 quasi-latin squares 
Square I 
ABC I confounded 
Set III (a) 000 102 201 O11 110 212 022 121 220 
(c) 101 200 002 112 211 010 120 222 021 
(6) 202 001 100 210 O12 111 221 020 122 
(a) O11 110 212 022 121 220 000 102 201 
(c) 112 211 010 120 222 021 101 200 002 
(ot) 210 012 111 221 020 122 202 001 100 
(a) 022 121 220 000 102 201 O11 110 212 
(©) 120 222 021 101 200 002 112 211 010 
(b) 221 020 122 202 001 100 210 012 111 
Stl @ ®© © @ ® © @ ® © 


AEF 
BDE 
ACD 
BCF 
ABDF 
CDEF 
ABCE 


ABC Ill 
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Plan 8.5 (Continued) 3° factorial in 9 X 9 quasi-latin squares 
Square IT 
ABC IL 
Set IV (a) 000 101 202 012 110 211 021 122 220 
(b) 102 200 001 111 212 010 120 221 022 
(c) 201 002 100 210 O11 112 222 020 121 
(a) 012 110 211 021 122 220 000 101 202 
(b) 111 212 010 120 221 022 102 200 001 ABCIV 
( 210 O11 112 222 020 121 201 002 100 
(a) 021 122 220 000 101 202 012 110 211 
(b) 120 221 022 102 200 O01 111 212 010 
(c) 222 020 121 201 002 100 210 O11 112 
sti @ © ® @ © ®) @ © 


Plan 8.6 3! factorial in 9 X 9 quasi-latin squares 
Square I 
Columns confounded with ABC II, ACD II, ABD IV, 
BCD IV Rows confounded with 
0000 1011 2022 0121 1102 2110. 0212 1220 2201 ABC IV 
1021 2002 0010 1112 2120 0101 1200 2211 0222 ABDI 
2012 0020 1001 2100 Ol11 1122 2221 0202 1210 ae 
0122 1100 2111 0210 1221 2202 0001 1012 2020 
1110 2121 0102 1201 2212 0220 1022 2000 0011 
3101 0112 1120 2222 0200 1211 2010 0021 1002 
D211 1222 2200 0002 1010 2021 0120 1101 2112 
| 1202 2210 0221 1020 2001 0012 1111 2122 0100 
9220 0201 1212 2011 0022 1000 2102 0110 1121 
Square IL 
Columns confounded with ABC I, ACD I, ABD IIl, 
BCD Il Rows confounded with 
9000 1022 2011 0112 1101 2120 0221 1210 2202 ABC IIL 
012 2001 0020 1121 2110 0102 1200 2222 0211 ACD IV 
30210010 1002 2100 0122 i111 2212 0201 1220 ps 


Qil1 1100 2122 0220 1212 2201 0002 1021 2010 
1120 2112 0101 1202 2221 0210 1011 2000 0022 
2102 0121 1110 2211 0200 1222 2020 0012 1001 
0222 1211 2200 0001 1020 2012 0110 1102 2121 
{201 2220 0212 1010 2002 0021 1122 2111 0100 
2210 0202 1221 2022 0011 1000 2101 0120 1112 
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Plan 8.7 4 X 2? factorial in an 8 X 8 quasiatin square 
ABC 
110 311 211 001 200 010 101 300 A'BC 
011 100 000 210 301 201 310 111 
101 210 O11 311 000 300 110 201 A" BC 
310 001 200 O10 211 111 301 100 
200 010 101 300 110 311 211 O01 A'BC. 
301 201 310 111 011 100 000 210 
000 300 110 201 101 210 O11 311 ABC 
211 111 301 100 310 001 200 010 
Plan 8.8 2 X (2”) factorial in a 4 X 4 half-plaid square 
ABC k squares (2k replicates) 
Squares (k — 1) 
-|b ¢ (1) be Rows 3k 
= la. bi be” <1) A 1 
Se Eee Error (8k — 1) 
aj(l) be b c¢ Columns 3k 
ajbe (i) ¢ 6 B, C, BC, AB, AC 5 
Error (9k — 5) 
Total (16k — 1) 
Plan 8.9 2 X (8 X 2) factorial in a 6 X 6 half-plaid square 
BC, ABC partially confounded 
c¢ | ab Rows 5 
¢ 1 
Oy 0k OO} 11 10 21 20) Error 4 
Om ALi =20° 21 00" -O1 Columns 5 
0/20 2 oo OL 10 A, B, AB, AC 7 
Cc BC ay 
Den OOg Ol LO edt 6 205 952 ABC 2! 
1 J LO 2215 20 01, 100) Error 15 
Dye 220" 01) 00 = 1t .10 an 
Total 35 
la Ib Wa Ib Ila Ill 
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| : Plan 8.10 3 X (3 X 2) factorial in a 6 X 6 half-plaid square 
| 
| Square I Square II 
AB, ABC partially confounded AB, ABC partially confounded 
a | be a | be 
0. | 20° 10! 00) <2), ofls ROL 0 |710- 20,00: 11. 21 sor 
0 Ti (OL) 21. 10.) 0020 O21, Ol, Ad, 20). 00 arto: 
1 00> 20% 10, NOL C2 eeEL Fi 1 00" 10)" 120.5 01 “tia 
1 Dy Vt Ole scot OLne OO: 1 Type 21» 10 7910), 2000, 
2 Ot (25 wat OOF 20 Re 10 2) 1220" 00)" “40 521) Olea 
2 10° sO 20 ey Tee Ole ireal: 28\ OLA Ll 21" OO Omen 
Ia Ib Ic Wa Ib Ile Illa Illb Ile IVa IVb IVe 
Square I alone (2 replicates) Both squares (4 replicates) 
Rows 5 Squares 1 
A 2 Rows 10 
Error 3 A 2 
Columns 5 Error 8 
B, C, BC, AC 7 Columns 10 
AB 4 B, C, BC, AC 7 
ABC 4’ AB 4! 
Error 10 ABC 4! 
_ Error 35 
Total 35 — 
Total val 
Plan 8.11 2 X (2%) factorial in an 8 X 8 half-plaid square 
ABCD confounded 
-|() be bd ced b c d_ bed Rows 7 
-|be (1) cd bd d bed b Cc A 1 
-|bd cd (1) be bd d c b Error 6 
-|led bd be (1) c Bb bed d Columns 7 
A B,C,D . 3 
a c d bed (1) bd cd be AB,AC,AD,BC,BD,CD 6 
ale b bed d be cd bd (1) ABC, ABD, ACD, BCD 4 
a|ld bed b c ed (1) be bd Error 86 
albed d c b bd be (1) ed — 
L Total 63 
| 
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2X (2') factorial in an 8 X 8 half-plaid square 
ABD, BCE, ACDE confounded 


(1) bd be cd ce bede be de Rows 

bed ¢ d bee bde e cde b A 1 
ce bede be de (1) bd be cd Error 6 
bde e cde b bed c ad bee Columns 

Main effects 


be cd ce bede be de (1) bd ‘Two-factor interactions 
d b bed c cde bee bde e Three-factor % 

be de (1) bd be cd ce bede Four-factor € 

cde bee bde e d b  dbede Error 


Total 


2X 2 X (2°) factorial in an 8 X 8 plaid square 
B, ACDE, ABCDE confounded 


Be (= sb ob ob BS CRows: 

A 1 
(@) e cd cde|ce c de d Error 6 
ce” ic de d |cd e (1) ede Columns 
ed cde (1) e de d ce oc B 1 
de d ce c |(1) ede cd e Error 6 

Main effects 
e (1) cde cd |e ce d de ‘Two-factor interactions 
ec de d ce je (1) cde ed Three-factor “I 
cde cl e (1)|}d de ec ce _ Four-factor oe 
a@ ce c dej|cde cd e (1) Error 
Total 
2X 2X (24) factorial in an 8 X 8 plaid square 
B, ACD, CEF, ABCD, BCEPF, 
ADEF, ABDEF 

ieee cm ae D8 |b 6: ib Rows 

A 1 
e cde df cf |ced (1) cef def Error 6 
f cdf de ce \cdefef c ad Columns 
cd (1) cef defje cde df cf B 1 
cdefief ci dit \if - edf ‘de> ce Error 6 


Main effects 

df cf e  cde\cef def cd (1) Two-factor interactions 
de ce f cdf|c ad _ cdef ef Error (from high-order 
cef def cd (1)|df cf e cde _ interactions) 

¢ @  cdefef jde ce ff" cdf 


Total 


30 


63 


CHAPTER 9 


BALANCED AND PARTIALLY BALANCED INCOMPLETE 
BLOCK DESIGNS 


9.1 Balanced Designs 


‘As their name implies, these designs, introduced by Yates (9.1), are 
arranged in blocks or groups that are smaller than a complete replica- 
tion, in order to eliminate heterogeneity to a greater extent than is pos- 
sible with randomized blocks and latin squares. In the designs de- 
scribed in chapters 6 to 8, this reduction in the size of block was achieved 
by sacrificing all or part of the information on certain treatment com- 
parisons. The present designs, on the other hand, were developed for 
experiments in plant breeding and selection, where it is desired to make 
all comparisons among pairs of treatments with equal precision. Con- 
sequently, a different method for reducing the size of block is employed. 

The designs may be arranged either in randomized incomplete blocks 
or in quasi-latin squares. They may be balanced or partially balanced. 
The balanced designs will be illustrated first by simple examples of the 
experimental plans. 

Consider the plan in table 9.1 which compares 9 treatments in in- 
complete blocks of 3 experimental units with 4 replications. 

Every pair of treatments will be found to occur once, and only once, 
in the same block. For instance, treatment 1 occupies the same block 


TABLE 9.1 BALANCED DESIGN FOR 9 TREATMENTS IN BLOCKS OF 3 UNITS 


Block Rep. I Rep. IL Rep. II Rep. IV 
(py ely ae (Ay ae (7) 15 9 (40) 1 8 6 
(2) 4 5 6 (5) 2 5 8 (8:) 7 2 6 ay 4 2 9 
@) (REE) () 3 6 9 (9) 4 8 3 (12) 7 5 3 


with treatments 2 and 3 in the first replication, with treatments 4and7 

in the second replication, with treatments 5 and 9 in the third replica- 

tion, and with treatments 6 and 8 in the fourth replication. When the 

results are analyzed by the method of least squares, this property, to 
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which the adjective balanced is applied, ensures that all pairs of treat- 
ments are compared with approximately the same precision, even though 
the differences among blocks may be large. This design belongs to the 
group known as balanced lattices, so-called because the plans are con- 
veniently written down by drawing a square lattice, with the treatment 
numbers at the intersections of the lines. In the balanced lattices, the 
number of treatments must be an exact square while the number of units 
per block is the corresponding square root. 

Balanced designs can be constructed for other numbers of treatments 
and of units per block. The plan in table 9.2 shows 7 treatments ar- 
ranged in blocks of 3 units. 


TABLE 9.2 BALANCED DESIGN FOR 7 TREATMENTS IN BLOCKS OF 3 UNITS 
Block 

(Qj) 12 4 (3) 3 4 6 (5) 1 5 6 @ £3 % 
(2) 2 3 °5 OAs et 6) 2 6 7 


Again every pair of treatments occurs once within some block. In 
this case, however, the blocks cannot be grouped in separate replications, 
since 7 is not divisible by 3. Designs of this type are known as balanced 
incomplete blocks. 

For certain numbers of treatments and units per block, both the types 
above can be laid out in a kind of latin square formation so as to allow 
the elimination of variation arising from two types of grouping. The 
appropriate rearrangement for the first example is shown in table 9.3. 


TABLE 9,3 BALANcEeD DESIGN FOR 9 TREATMENTS IN 4 LATTICE SQUARES 


Rep. I Rep. IL Rep. II Rep. IV 
Columns 
Rows (1) (2) (3) (4) (5) (6) ( (8) (9) (10) (11) (12) 
Oi B38 4) 14 7 (71 6 8 el, Ss 
(2) 4 5 6 (5) 2 5 8 (83) 9 2 4 (D6! Saas 
(jy. (ans) (6) 3 6 9 9) 5 7 3 (G0), sen ee 


Tt may be verified by inspection that every pair of treatments now 
occurs once in the same row and also once in the same column. All 
comparisons between pairs of treatments are of nearly equal precision. 
This design is known as a lattice square. 

The second example is rewritten somewhat differently. 

Every treatment now appears in each of the 3 rows, and every pair of 
treatments appears together once in the same column. Since the plan 
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TABLE 9.4 BALANCED DESIGN FoR 7 TREATMENTS IN AN INCOMPLETE 
LATIN SQUARE 


Columns (Blocks) 
ag 28 ®@ ® © M 


Rows 
Ieee, ENE Site lake 
1D) 9) Sic tee Gere ear 
Q)n ak See ery ede 


above could represent the first 3 rows of a7 X 7 latin square, this type 
of design has been called an incomplete latin square, or alternatively & 
Youden square, after W. J. Youden, who developed these designs for 
greenhouse experiments. 


9.2 Partially Balanced Designs 


Although a balanced design can be constructed with any number of 
treatments and any number of units per block, the minimum number of 
replications is fixed by these 2 variables. In most cases this number is 
too large for the usual conditions of experimentation. In order to allow 
more freedom of choice in the number of replicates, designs which lack 
the complete symmetry of the balanced designs must be used. 

The most useful of such designs are the lattices. These are constructed 
in the same way as balanced lattices except that there are fewer replica- 
tions. The design with 2 replications (e.g., the first 2 replications in 
table 9.1) is called a simple lattice, and that with 3 replications a triple 
lattice. Similarly, with a lattice square, as in table 9.3, we may use less 
than the full number of replicates necessary for balance. For all these 
designs the number of treatments must be a perfect square. 

A further set, called cubic lattices, is useful when the number of treat- 
ments is very large. The number of treatments is the cube of the num- 
ber of units per block, so that a drastic reduction in block size is obtained. 
The number of replicates is 3 or some multiple of 3. 

Partially balanced designs are less suitable than balanced designs in a 
number of ways. The statistical analysis is more complicated. It is 
mainly for this reason that we have not included any partially balanced 
designs corresponding to those in tables 9.2 and 9.4, Further, when the 
variation among blocks (or rows and columns) is large, some pairs of 
treatments are more precisely compared than others, so that several 
standard errors are required for tests of significance. Also, the overall 
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precision of the experiment is decreased by the lack of symmetry. This 
means that under comparable conditions a partially balanced design 
gives a slightly higher standard error per replication than a balanced de- 
sign. These disadvantages are of minor importance in lattice designs, 
which will be found almost as convenient as balanced designs. 


9.3 Basis of the Statistical Analysis 


9.31 Analysis without Recovery of Inter-block Information. To those 
who have previously used only the simplest designs, one feature of the 
incomplete block designs will be unfamiliar. The treatment averages 
must be adjusted in order to secure the full accuracy available in the de- 
sign. That this is necessary can be seen by examining, for example, the 
palanced lattice in table 9.1. In three of the 4 replications, treatments 
1 and 2 are in different blocks. Consequently, if the simple averages of 
the 2 treatments are compared, the effects of differences among blocks are 
eliminated only for the first replication. 

Yates developed two methods of analysis for the designs. The first: 
is an analysis by standard least squares, reference (9.1), sometimes re- 
ferred to as the “intra-block” analysis. Later he showed that com- 
parisons among the block totals also contained information about treat- 
ment effects that can be utilized in the larger experiments. This ap- 
proach is analogous to that in split-plot experiments, where it will be re- 
called that use is made both of comparisons within whole-units (corre- 
sponding to incomplete blocks), and of comparisons among the totals of 
different whole-units. In this section the original method of analysis 
will be outlined for balanced lattice experiments. This method has not 
been entirely superseded by the newer analysis, because it must be used 
with small experiments. The newer analysis is sketched in section 9.32. 

For the original analysis the mathematical model and the assumptions 
are essentially the same as those for previous designs in this book, as 
described in section 3.2. In the balanced lattice there are k? treatments 
in blocks of & units, with (% + 1) replications. Let y;;, be the observa- 
tion for the gth treatment, which we suppose to be in the jth block within 
the ith replication. The model is 


Yiiq = B+ te + Big + Tq + Cis (9.1) 


where y, 7, ij, and 7, represent the effects of the mean, the replicate, 
the incomplete block, and the treatment, respectively, and ¢;jq is the 
intra-block residual or error, assumed to be normally and independently 
distributed with mean zero and variance o,”. As with previous designs, 
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we find estimates of the parameters by minimizing 
D Wiig — m — Di — dij — te)? 
subject to the usual relations 
E pi = 0,7 by: b;; = 0 (for any 2), Dit i— 0 (9.2) 
t J q 


The normal equation for any unknown is obtained as before by equat- 
ing the observed total to the expected total over all units whose equation 
contains the constant. Thus, if 7, denotes the total for all (k + 1) 
units that receive the gth treatment, the normal equation for é, is 


Ty = (e+ Vm t Db + (b+ Yt (9.3) 


the terms in the p; having disappeared in virtue of equations (9.2), since 
any treatment appears in all replicates. 

This equation does not at once give the value of tq, since it still contains 
the unknowns m and b,;. From the normal equation for m, it is easy to 
see that m is the average over the whole experiment. The normal equa- 
tion for any b;; is 

By = km + kp; + hoy + DY ta (9.4) 
where B;; is the observed block total, and the treatments sum is over the 
k treatments that are in the block. If we add these equations for all 
blocks that contain the qth treatment, we obtain 


Bp =k +1)m+khDby+ Do Vite (9.5). 


where B, is the total of all such blocks. 
It is at this point that the structure of the design is important. In 
the blocks that contain ¢,, every other treatment occurs once and only 


once, while f, appears (k + 1) times. Hence 
YY = Et lq + DX (all other t's) = hég + > (all ¢’s) = htg 
from (9.2). 
This simplifies (9.5) to 

By = k(k + 1)m + kD bij + Mtg (9.6) 
We may now eliminate the unwanted term >) bj; from (9.3) and (9.6), 
giving 
kT, — By = tg (9.7) 


a relatively simple solution. In effect, we simply adjust the treatment 
total for the total of the blocks in which it appears. 
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Since ty represents the deviation of the estimated treatment mean from 
the mean for all treatments, it is customary in practice to compute 
(tg + m), which represents the estimated mean itself. A little algebraic 
manipulation from (9.7) shows that 

T, | (kTg—(k+ UB. + Gl 
k+1 (ke + 1) 
where G = k?(k + 1)m is the grand total for the whole experiment. 
Note that 7',/(k + 1) is the simple or unadjusted treatment mean. The 
last term on the right therefore represents the adjustment that is applied 
to the ordinary mean. This result will be used in a later comparison with 
the newer analysis. 

‘A continuation of the analysis shows that the variance of the difference 
between two t, values is 2c,2/k. Tf the experiment had been in ran- 
domized blocks, with the same number of replications, (k + 1), the cor- 
responding variance would have been 207/(k + 1), where co” is the error 
variance for randomized blocks. Hence the new design, with this method 
of analysis, gives a more accurate experiment than randomized blocks if 
and only if 


(9.8) 


tm = 


oe k 
ee 
Cee, ome G 


The quantity k/(k + 1) is called the efficiency factor of the design. Note 
that the incomplete block design would be less accurate than randomized 
blocks if the variation among incomplete blocks were as great as that, 
within complete replications. With the newer method of analysis this 
disadvantage is largely removed. 


9.32 Analysis with Recovery of Inter-block Information. This analysis 
rests on more difficult theory and will be outlined only in part. Formally, 
the mathematical model is exactly the same: 


Yijg = B + as Tq + Bij + Ciiq (9.9) 


However, the additional assumptions are made that the block effects 
;; are normally and independently distributed with zero means and 
variance o)”, and that they are independent of the e;jq. It follows from 
the assumptions that for the difference between two observations in the 
same block, the residual variance is 20.7, since the 8;;’s eancel, whereas 
for the difference between two observations in different blocks, the 
residual variance is 2(c.2 + 04). The reader will note the analogy with 
split-plot experiments (section 7.12). With regard to the justification 
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for the additional assumptions, it can be said that in agricultural field ex- 
periments, for which the designs were first developed, it is usually just 
as reasonable to make the assumptions for the 6;; as for the e;jg. The 
assumptions cannot be taken for granted, and with certain types of data 
we might not wish to make them. 

‘Asa result of the assumptions, observations that are in the same block 
are positively correlated, so that the simplest type of least squares 
estimation cannot be applied. Instead, we may write down the joint 
frequency distribution of all the observations, and use the more general 
method of estimation known as maximum likelihood. By a well-known 
device in theory, maximum likelihood estimation may be shown to be 
equivalent to the minimization of a weighted sum of squares, consisting 
of two parts. The first part is the sum of squares of deviations of the 
residuals from their block means; the second is the sum of squares of the 
residuals of the block totals. The two parts receive weights w and w’/h, 


respectively, where 
1 1 


w! 


oe oe + hoy? 


For the moment we suppose that w and w’ are known. For the balanced 
lattice, the quantity to be minimized is 


w' 
D whysie — vis — le — HP + Be [Biz — km — kepy — hetiy.P- 
(9.10) 


where y;;.,ti3. denote, respectively, the observed mean of a block and the 
mean of the ?’s that occur in the block. The first sum is over all observa- 
tions, the second over all blocks. 

When we differentiate with respect to a given t,, we must note that the 
term ¢;;, will contain tq, with coefficient 1/k, whenever the block in ques- 
tion contains the gth treatment. The derivative, when equated to zero 


(omitting a factor —2), gives 


w 
Dd wlyiia — vis — ta + bp + Ly Bi — km — kp; — kts] = 0 
(9.11) 


The first sum is over the observations that receive the gth treatment; 
the second over the blocks that contain this treatment. The derivative 
contains an additional part arising from the terms in ¢;;, in the first 
bracket of (9.10), but this contribution will be found to vanish. 

The next step is to simplify (9.11). Both brackets involve a sum of the 
terms ¢;;.- In each case, because of the symmetry of the balanced lattice, 
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the sum contains all other ¢’s once, and t, ( + 1) times. Hence 


Ty (k + 1)tq + (vest of t’s) s" ktq 


k k 
since the total of all ’’s is zero. Further, we may note that the observed 
block mean y;;, equals B,;/k. Writing as before 7’, for the treatment 
total and B; for the total of all blocks in which the treatment appears, we 
have from (9.11) 


tq 


B J 
wong — AE e+ a + ta] +9 Ba HE + Dm — Be =i 


(9.12) 
or B, 


(kw + w')tg + (& + 1)w'm = wT, — (w— w’) - (9.18) 


The constant m may be shown to be the mean for the whole experiment, 
or G/k?(k +1). After some rearrangement we may express (9.13) in a 
form comparable with the original estimate given in (9.8), as follows. 


“ A 
et Ts * (w — w')[kT, — (kh + 1)Bi + G] (9.14) 
k+1 k(k + 1)(kw + w’) 

The adjustments are seen to be of the same form in the new as in the 
original analysis. The two are equal when w’ is zero. This occurs when 
oy? is very large; that is, when differences among blocks are great. In 
all other cases the adjustments are smaller with the new than with the 
original analysis. They reduce to zero when w’ = w, which happens 
only if «4? is zero, or in other words if there are no real differences among 
blocks. Thus the new analysis is a generalization of the original anal- 
ysis, reducing to it in the extreme case where there are marked variations 
among incomplete blocks. At the other extreme, when the arrangement 
into blocks has been ineffective, the new analysis makes no adjustments 
for the non-existent block differences, and in fact reduces to an analysis 
by the method for randomized blocks. 

In practice the weights w and w’ are not known. Yates has shown 
that they can be estimated from the analysis of variance, in the same way 
as we estimate two separate errors in a split-plot experiment. The de- 
tails will not be given. The fact that weights are estimated rather than 
exact introduces some additional sampling variation into the adjust- 
ments. This is unimportant in the larger experiments, but in certain 
of the smaller experiments the weights cannot be estimated accurately 
and the original analysis is recommended. For other accounts of the 
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analysis by different approaches, see references (9.2), (9.3), (9.4), and 
(9.5). 


9.4 Comparison of Incomplete Block and Randomized Block Designs 


So far as the experimental operations are concerned, incomplete block 
designs are no more difficult than randomized blocks. Some extra 
planning is involved in drawing up and randomizing the experimental 
plan, especially if care is taken to make the best possible grouping of the 
experimental units. According to the computing equipment and ex- 
perience available, the time required for the statistical analysis may ex- 
ceed that for randomized blocks by 20 to 150%. 

The gain in accuracy over randomized blocks depends on the type of 
experimental material and may be expected to increase as the number of 
treatments is increased. Most of the incomplete block designs cover the 
range from 6 to 200 treatments, while the cubic lattices extend this range 
to 1000 treatments without requiring a large incomplete block. The 
number of treatments for which a substantial increase in accuracy is at- 
tained must be determined by experience. If the experimental material 
is highly variable and yet lends itself to the formation of small groups 
which are homogeneous, the designs may be advantageous even with 
small numbers of treatments. From the results of varietal trials a num- 
ber of comparisons with randomized blocks have been made: see, for ex- 
ample, references (9.6), (9.7), and (9.8). The experiments varied in size 
from the 3 X 3 to the 11 X 11, and included both lattices and lattice 
squares. They indicated an average gain in accuracy of the order of 
25%. This means that 4 replications of an incomplete block design were 
about as accurate as 5 replications of randomized blocks. 

The ease with which the number of replications can be increased is 
also a factor. The object in the new designs is to obtain the most ac- 
curate comparisons that are possible from a given number of experi- 
mental units. Accordingly, the designs are likely to be most helpful when 
the amount of experimental material or considerations of cost and labor 
force the experiment to be smaller than is desired. Where the number of 
replicates can be increased without difficulty, the experimenter may pre- 
fer some extra replication of a simpler design which avoids the calcula- 
tion of adjustments. 

There is one important property, possessed by many of the designs, 
which increases their attractiveness relative to randomized blocks. As 
the plans show, the lattice square and the lattice designs are arranged 
in complete replications as well as in incomplete blocks. A few of the 
balanced incomplete block designs can also be grouped into replications. 
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Such designs may be regarded as randomized block designs which have 
additional restrictions within each replication. It has been shown by 
Yates (9.2) that these designs can be analyzed as if they were ordinary ran- 
domized blocks. This implies that the unadjusted treatment means give 
unbiased estimates of the true treatment effects, and that the F- and t- 
tests do not lose their validity. Of course, this analysis will in general 
be less accurate than the complete analysis. 

Thus, if there is any criterion for forming incomplete blocks, an in- 
complete block design is worth a trial in preference to a randomized 
block design which occupies the same set of replications. When the data 
have been collected, the experimenter may choose whether to analyze 
them as randomized blocks or to complete the full analysis, with the 
adjustments for incomplete block variations. In fact, if the variation 
among incomplete blocks is no greater than that within blocks, the com- 
plete statistical analysis reduces automatically to that for randomized 
blocks, as mentioned in section 9.32. 

The analysis as randomized blocks may be useful in experiments where 
several measurements are made on each experimental unit. In the for- 
mation of incomplete blocks the units are usually grouped with regard 
to the most important measurement. For certain other measurements 
from the same experiment, the grouping may be less effective; in such 
cases the “randomized blocks” analysis will be satisfactory, as also with 
measurements of subsidiary interest, where the greatest attainable ac- 
curacy is not required. 

In some types of research an appreciable number of units are likely to 
be injured or destroyed in the course of the experiment, so that they must 
be omitted from the statistical analysis. With incomplete data (except 
where only a few units are missing) laborious computations are required 
to calculate the block (or row and column) adjustments. Consequently, 
in experiments where missing data are of frequent occurrence, incom- 
plete block designs cannot be used to full advantage. Even in this case 
nothing is lost by using an incomplete block design which can be ar- 
ranged in complete replicates. If when the experiment is completed it 
becomes evident that an appreciable number of units must be discarded, 
the experimenter may use the randomized blocks analysis in which the 
extra complication due to missing data is smaller. The same con- 
siderations apply in cases where certain treatments may have to be 
ignored in the final analysis. 


9.5 Comparisons with Other Designs 


Yates (9.1) has discussed three other designs that have been used 
when the number of treatments is large. 
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9.51 Systematic Controls. In agricultural field experiments, a control 
variety is sometimes placed at regular intervals over the site, the ex- 
perimental varieties being arranged in complete blocks. From the yields 
of the control plots a fertility index may be calculated for every plot in 
the experiment. These indices are used to adjust the experimental plot 
yields for local variations in fertility. 

The method of systematic controls is very flexible, since it can be used 
with any number of treatments and any number of replicates. Little 
evidence is available about the increase in accuracy obtained from the 
controls, though it seems probable that the increase is seldom large if the 
extra space occupied by the controls is taken into account. The caleula- 
tion of the best adjustments, as described by Yates (9.1), is rather tedi- 
ous, but if a crude type of adjustment is made most of the potential ad- 
vantages of the method may be lost. 

There may be additional reasons for the presence of extra controls, 
e.g., for their use in an observational scoring of the experimental ma- 
terial. In this connection it should be noted that extra controls can be 
included in an incomplete block design. 


9.52 Random Controls. In another method the treatments are di- 
vided into groups, the grouping remaining the same in all replicates. 
Each group is arranged in a separate randomized block or latin square 
experiment. In order to obtain comparisons between treatments that 
are in different groups, one or more controls are included in each group 
and randomized along with the other treatments. Before comparing 
treatments that are in different groups, we subtract from each treat- 
ment mean the mean of the controls that are in the same group. Thus 
the controls serve to correct for differences in the fertility of the sites on 
which different groups are tested. 

This design allows considerable flexibility in number of treatments and 
amount of replication and is simple to analyze. From theoretical con- 
siderations it is likely to be inferior in accuracy to a comparable in- 
complete block design, if one exists. Moreover the error variance is not 
the same for all types of comparison. If only one control is included, 
the variance is twice as great for the difference between two treatments 
that are in different groups as for two treatments in the same group. 


9.53 Split-plot Designs. This arrangement is a variant of the previous 
design which avoids extra controls. Asin section 9.52, the treatments are 
first divided into a number of groups of equal sizes. Instead of testing 
each group on a separate site, the groups are combined into a single ex- 
periment of the split-plot type. For example, with 25 treatments and 
5 replications, we might first divide the treatments into 5 groups, A, B, 
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C, D, and E. The groups are now regarded as whole-plot treatments, 
and could be arranged ina 5 X 5 latin square on plots 5 times as large as 
the basic plots. Within each group the treatments are arranged at ran- 
dom on the individual plots. 

From the analysis for a split-plot design (section 7.15), it will be seen 
that we can test the difference between two treatments whether they 
are in the same group (i.e., whole-plot) or in different groups. Controls 
are no longer needed. The advantages and disadvantages of this ar- 
rangement are in general similar to those of the “random controls’’: the 
split-plot design is more accurate if the space that would have been al- 
lotted to the controls is utilized. Sometimes the split-plot design is 
really more appropriate than an incomplete block design because the 
treatments divide themselves naturally into groups, and comparisons 
between members of the same group are regarded as more important 
than comparisons between members of different groups. 


9.6 Choice of Incomplete Block Design 


Tables-9.5 (randomized incomplete blocks) and 9.6 (lattice squares 
and incomplete latin squares) form an index to the incomplete block de- 
signs in this book. Each table is arranged by number of treatments (6) 
and number of units per block (k) and shows the numbers of replications 
for which designs are available, with references to the plans. It is hoped 
that the tables provide a rapid means of locating a suitable design, if one 
has been constructed. 

Where the number of treatments and the size of block are fixed in ad- 
vance by the conditions of the experiment, little choice is available to 
the experimenter. In certain types of research, however, both the num- 
ber of treatments and the size of block (or row and column) can be varied 
to some extent without impairing the experiment. 

Where more than one design appears appropriate, a design which can 
be arranged in separate replicates is preferable to one which cannot, and 
a balanced design is preferable to a partially balanced design. These 
recommendations usually narrow the choice to one or two designs. For 
example, with 25 treatments to be compared in 8 replications, we might 
use (see table 9.5) either a balanced incomplete block design in blocks of 
4 units, or a lattice design (partially balanced) in blocks of 5 units. The 
latter design, though not the former, can be laid out in separate repli- 
cates. 

The relative advantages of lattice and lattice square designs can be 
learned only by experience. From the results of a lattice square experi- 
ment we may estimate what the standard error would have been if 
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TABLE 9.5 DESIGNS ARRANGED IN RANDOMIZED INCOMPLETE BLOCKS 


Name Num- Number of Num Num-| Number of 
ber of | De? of] replications, ber of |Det.f| teplications, 
treat- Mare a pee ‘i ae eats Plan 
ss per number er num 
ments, | block, ments, | lock, bers 
k | pb. | bt : k | pbt | bt 
x 2 3 | 11 16 4 |2-- 5 | 10.2 
3 3 ie 6 6 11.27 
5 2 4 11.2 6 9 11.28 
3 6 1d? 10 10 11.29 
4 4 11. 19 3 9 11.30 
6 2 5 11.3 9 9 11.31 
3 5 11.4 10 : 10 11.32 
3 10 11.5 20 4 |2,3 10.11 
4 10 11.6 21 3 10 11.33 
5 5 11. 5 5 11.34 
7 2 6 AM 7 10 11.35 
3 3 PLS 25 4 8 11.36 
4 4 11.8 5 | 2-> 6 10.3 
6 6 11. 9 9 11.37 
8 2 7 1129) 27 3/3 10. 
4 a 11.10 28 4 9 11.38 
7 7 DLs 7 9 11.39 
9 2 Gy |) cil, 30 5 12,3 10.12 
3 2,3 4 10.1 31 6 6 11.40 
4 8 11.11 10 10 11.41 
5 10 11.12 36 6 | 2,3 10.7 
6 8 11.13 37 9 9 11.42 
8 8 11. 41 5 10 11.43 
10 2 9 11.14 42 6 | 2,3 10.13 
3 9 11.15 49 We | Qe 8 10.4 
4 6 11.16 56 7 | 2,3 10.14 
5 9 11.17 57 8 8 11.44 
6 9 11.18 64 4 13 10. 
9 9 16s 64 8 | 2--- 9 10.5 
ll 2 10 ll. 72 8 | 2,3 10.15 
5 5 11.19 73 9 9 11.45 
6 6 11.20 81 9 | 2-5- 10 10.6 
10 10 I, 90 9 | 2,3 10.16 
12 3 2,3 10.10 91 10 10 11.46 
13 3 6 11.21 100 10 | 2,3 10.8 
4 4 11.22 121 1 | 25> 10.(12.7) 
9 9 11.23 125 5 13 10. 
15 3 Gs 11.24 144 12 | 2,3,4 10.9 
7 ff 11.25 169 18 | 2+-> 10.(12.8) 
8 8 11.26 


* Or any multiple of this number. 
+ p.b. = partially balanced designs. 
b. = balanced designs. 


{References 10., 11., denote chaplers. 
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TABLE 9.6 DuEsIGNS ARRANGED IN LATTICE SQUARES AND INCOMPLETE 
LATIN SQUARES 


either of the two types of grouping had not been used. Consequently the 
experimenter may test additional methods of grouping by using lattice 
squares. If the extra grouping turns out to be ineffective, the accuracy is 
only slightly less than that of the corresponding lattice design. The 
statistical computations are, however, more laborious for lattice squares, 
since correction terms must be obtained for both row and column effects. 


Num- Num- 
Num- | ber of oe of Num- | ber of AUER of 
: replications, A replications, 
ber of | units rt Plan ber of | units r* Plan 
treat- | per number icra aes number 
ments, | col. ments, | col 
t and t and 
row | p.b.f | b.t row | p.b.t | b.f 
Sap se5si ae 5 13.16 | 1,5 5 | 13.3 
3,7 7 13.17 11,6 6 13.4 
3,8 iS 13.18 11, 10 10 | 13.¢ 
3,10} 10 13.19 13 | 13,4 4 13.5 
4 | 4,3 (3)§,6| 13.t 13,9 9 | 13.6 | 
4,5 5 13.20 15 | 15,7 7 13.7 
4,7 if 13.21 15,8 8 13.8 
49| 9 13.22 16 | 4,4 5 | 12.2 
5 5,4 4 13.} 16, 6 6 13.9 
5,6 6 13.23 16, 10 10 13.10 
5,9 9 13.24 19 | 19,9 43) 13.11 
6 | 65 5 | 13.¢ 19, 10 10 | 13.12 
6,7 7 13.25 21 | 21,5 5 13.18 
7 7,3 3 13.1 25 5,5 3 12.3 
7,4 4 13.2 31 | 31,6 6 13.14 
7,6 6 13. 37 =| 37,9 9 13.15 
7,8 8 13.26 49 Ak 3 4 12.4 
8 7,8 7 13. 64 8,8 | 3-°- 9 12.5 
9 3,3 (2)§,4) 12.1 81 9,9 3,4 5 12.6 
9 9,8 8 13.t 121 |11,11] 3-- 6 12.7 
10 10, 9 9 13. 169 | 13,13] 3-- 7 12.8 
* Or any multiple of this number. 
{p.b. = partially balanced designs. 
b. = balanced designs. 
{ Constructed from a ¢ X ¢ latin square by omission of last column. 
§(_) Not enough degrees of freedom for error. 
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In general, the lattice squares may be expected to be successful in types 
of experimentation where the latin square has been found superior to 
randomized blocks. 
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CHAPTER 10 


LATTICE AND CUBIC LATTICE DESIGNS 


10.1 Balanced Lattices 


10.11 Description. The number of treatments must be an exact 
square. The size of block is the square root of this number. Incomplete 
blocks are combined in groups to form separate replications. The special 
feature of the balanced lattice, as distinguished from other lattices, is 
that every pair of treatments occurs once in the same incomplete block. 
Consequently, all pairs of treatments are compared with the same degree 
of precision. 

The numbers of replications are rather severely restricted, as well as 
the numbers of treatments. The useful plans are indexed below. Bal- 


BALANCED LATTICES CONTAINED IN PLANS 10.1-10.6 


Number of treatments 9 16 25 49 64 81 
Units per block 3 4 5 G 8 9 
Replicates 4 5 6 8 9 10 


anced lattice designs cannot be constructed for 36 treatments, and none 
has been found for 100 or 144 treatments. 


10.12 Statistical Analysis. The analysis is comparatively easy. It 
will be illustrated by an experiment on the effects of 9 feeding treatments 
on the growth rates of pigs, conducted by the North Carolina Agricul- 
tural Experiment Station. The analysis has been described in more de- 
tail by Comstock et al. (10.1), and the nutritional information obtained 
from the results by Peterson et al. (10.2). 

For pigs of a given breed, previous experience indicated that a con- 
siderable part of the variance in growth rate between animals can be 
ascribed to the litter. Hence the experiment was planned so that litter 
differences would not contribute to the intra-block error. The pigs were 
divided into sets of 3 litter-mates. Two sets of 3 were assigned to each 
block. Within a block, each treatment received one member of each set. 
Thus the experimental unit was composed of 2 pigs each feeding in the 
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same pen. The plan and growth rates are given in table 10.1; the plan 
has been rearranged so that it follows the same pattern as plan 10.1. 


TABLE 10.1 Gans IN WEIGHT (POUNDS PER DAY) FOR A TOTAL OF 2 PIGS 


Rep. I Rep. IT 
Block Totals Block Totals 
ay (2) (3) @ @ (4) (7) 
2.20 1.84 2.18 6.22 Let Obs G20 8 deid' 3.54 
(2) (4) (5) (6) (5) (2) (5) (8) 
2.05 0.85 1.86 4.76 2.26 1.07 1.45 4.78 
(3) @) (8) (9) (6) (8) (6) (9) 
0:78 1.60: 1-76 4.09 2:12" “2503s A488, 5,78 
15.07 14.10 
Rep. I Rep. IV 
m (5) (9) (10) (1) (6) (8) 
1281 1.46" deat 4.08 17 61.57 1,48 4.77 
(8) 2 © @ qa) @) (4) (9) 
1.76 2.16 1.80 5.72 1.50 1.60 1.42 4.52 
(9) (3) (4) (8) a2) 3) (5) (7) 
Ly Abe dads 4.41 2.04 0.93 1.78 4.75 
14.21 14.04 
Treatment totals and adjustment factors 
Adjusted total | Mean per 
ue B W=(@87-4B+G) T+eW unit 
Al 6.97 18.61 +3,89 7.21 1.80 
2 7.36 21.24 —5.46 7.02 1.76 
3 8.05 21.16 —3.07 7.86 1.96 
4 6.42 17.23 +7.76 6.91 1.73 
5 4.01 18.37 —4.03 3.76 0.94 
6 7.62 21.03 —3.84 7.38 1.84 
7 5.46 18.10 +1.40 5.55 1.30 
8 5.61 18.05 +2.05 5.74 1.44 
9 5.92 18.47 +1.30 6.00 1.50 
G = 57.42 172.26 0.00 57.43 


The steps in the analysis are as follows. 


toak X k lattice 


1. Calculate the block totals, the replication totals, 


The algebraic formulae refer 


e in blocks of & units, with r = (& + 1) replicates. 


and the treatment totals 7’, shown under the plan. 


the grand total G, 


2. For each treatment, calculate the total B, for all blocks in which 
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the treatment appears. For treatment 4, this is 
B, = 4.76 + 3.54 + 4.41 + 4.52 = 17.28 


Asa check, the total of the B; values should be k times the total of the 7 
values. 
3. Compute the quantities 


W=kT-(K&K+1B+G 


whose sum should be exactly zero. 

4. The analysis of variance is now obtained. The total s.s. and the 
sums of squares for replications and treatments are found in the usual 
way. The sum of squares for blocks within replications, adjusted for 
treatment effects, is 


DW? _ (8.80)? + (6.46)? + +++ + (1.30)? _ 5 song 
i(k + 1) 108 ; 
5. Calculate the adjustment factor 
_ Gy — B.) _ 0.1776 — 0.073 
A BE, 9K 0.1776 


where H, and E, are the blocks and intra-block m.s., respectively. The 
adjusted treatment total is (7 + »W) as shown in table 10.1. To avoid 
confusion, the adjusted means are shown per unit (total of 2 pigs) al- 
though a mean per pig would be more natural. If Zy is less than E., » 
is taken as zero, and no adjustments are applied to the treatment totals. 


= 0.0628 


TABLE 10.2 ANALYSIS OF VARIANCE FOR TOTAL GROWTH RATE OF 2 PIGS 


df. 
so 
General k=3 8.8. m.s. 

Replications k 3 0.0774 
Treatments (k® — 1) 8 3.2261 
Blocks (adj.) (2 —1) 8 1.4206" 0.1776 
Intra-block error (k—(k2-1) 16 1.2368 0.0773 

Total (B+R—-1) 35 5.9609 


6. For ¢-tests, calculate the effective error m.s. 
E,! = E.(1 + ku) = 0.0773(1 + 3 X 0.0628) = 0.0919 


The purpose of the adjustment factor is to increase LZ, so as to take ac- 
count of sampling errors in the block correction values «4W. The ordi- 
nary rules for the calculation of ¢-tests may now be applied to #.’. Thus 
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the variance of the difference between two adjusted treatment totals is 
2rE.', while that for the difference between two adjusted treatment 
means is 2E,'/r, or 0.0460. The standard error of this difference is 0.214. 
Tf means per pig were taken, the standard error would be 0.107.* 

7. The treatments m.s. in table 10.2 cannot be tested against the 
intra-block error m.s., since the former contains some block effects. For 
an approximate F’-test, calculate the sum of squares of deviations of the 
adjusted treatment totals, which comes to 12.6771. This is divided by 
r or 4 to bring to a single-unit basis, and by (k? — 1) or 8 to obtain the 
mean square, 0.3962. This is tested against the effective error m.s., 
0.0919. The F-ratio, 0.3962/0.0919, or 4.31, has 8 and 16 df. 

8. In order to estimate the precision relative to randomized blocks, 
pool the mean squares for blocks (adjusted) and the intra-block error. 
The result is 0.1107, with 24 d.f., and is an unbiased estimate of the error 
variance that would have been present if the experiment had been ar- 
ranged in randomized blocks. This figure is compared with the effective 
error m.s., 0.0919. The relative precision is 0.1107/0.0919, or 120%. 
This means that 4 replications of the balanced lattice appear to have 
been slightly less accurate than 5 replicates of randomized blocks. 

These designs may be used in factorial experiments where it is desired 
to avoid any sacrifice of replication on the interactions relative to that 
on the main effects. Some of the combinations of levels that may be 
tested in balanced lattices are: 3 X 3,8 X 2,4X 4,4 X2X Dot beX 
5,7 X7,8 X8, 4°, 25,9 X 9, 34. In analyzing a factorial experiment, 
first carry through the procedure described in this section. The sum of 
squares for deviations of the adjusted treatment totals is then divided 
into the sums of squares for main effects and interactions in the usual 
way; a divisor 7 must be introduced in order to convert these sums of 
squares to a single-unit basis. The resulting mean squares are tested 
against the effective error m.s., E,(1 + ku), which has (k — 1)(k? — 1), 
or in this case, 16 d.f. 


10,18 Missing Data. Since lattice designs are often used in large ex- 
periments, involving perhaps several hundred observations, it is not easy 
to ensure that all the observations are accurately made. Even with care- 
ful management of the experiment, there is always a chance that mistakes 
or accidents will affect a few of the observations. Consequently, missing 
data tend to be more common with lattices than with small experiments. 

Methods for the analysis of the results of lattice experiments with in- 
complete data have been developed by Cornish (10.3). As might be ex- 
pected, the computations are lengthy. They become simpler in two 
special cases. The first arises when the incomplete blocks are ineffective, 


* Tf wis taken as zero, use E, as the effective error m.s., rather than the pooled m.s. 
for Ep and E,. This practice is recommended for future designs also. 
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so that the analysis reduces to a “randomized blocks” analysis. In this 
event values are substituted for the missing observations by the formula 
which applies to randomized blocks. The second case occurs at the other 
extreme when the variation among blocks is so large that the inter-block 
information is negligible. Here the correct procedure is to insert values 
for the missing observations by minimizing the intra-block error 
m.s. 

Since the general procedure must reduce to these two special pro- 
cedures in the appropriate cases, it might be anticipated, as Cornish’s 
solution shows, that both the “randomized blocks” and the “‘intra- 
block” estimates for the missing observations are required. Thus we 
have to find two estimates for each missing value. Similarly, two anal- 
yses of variance are necessary, one to obtain the correct value for the 
intra-block error m.s. and one for obtaining the mean square among 
blocks. 

In an attempt to reduce the amount of arithmetic, some investigation 
has been made of the consequences of using a single estimate and a single 
analysis of variance. For this purpose the “‘intra-block” estimate seems 
the better of the two. It gives the correct intra-block error m.s.; the 
block m.s.’is in general slightly too high. It provides an excellent ap- 
proximation when the blocks are effective and is at its worst when blocks 
are ineffective; that is, when a “randomized blocks” analysis should have 
been used. In the latter case it still gives unbiased estimates of the 
treatment means, but they are not as accurate as the estimates obtained 
by the use of the “randomized blocks” formula. The chief defect of the 
“randomized blocks” formula is that when block variation is large it 
tends to give an overestimate of the intra-block error m.s., and some- 
times the bias is substantial. For this reason the “randomized blocks” 
estimate is considered more hazardous for general use, despite its greater 
simplicity. 

The procedure that is given for the incomplete block designs in this 
and succeeding chapters is to insert values for the missing observations 
by means of the “‘intra-block” formula. Thereafter the analysis is con- 
ducted in the usual way for complete data, except that in the analysis 
of variance 1 d.f. is subtracted from the intra-block error for each miss- 
ing observation. This method, or the more accurate Cornish method, 
should be used whenever it is intended to recover inter-block information. 
If many observations are missing, or if it is evident on inspection that 
blocks are relatively ineffective, the experimenter may decide at the start 
to analyze the data as a randomized block experiment. In this case sub- 
stitutes for the missing values should of course be obtained by the 
formula for randomized blocks (section 4.25). 
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The “intra-block” formula for the estimate of a missing observation 
in ak X k balanced lattice with (& + 1) replications is as follows. 
PT +k(k+1)B-R+G—kTy — kB: 
ae hk — bP 
In this formula, 7’, B, and R denote as usual the totals for the treatment, 


block, and replication which contain the missing observation, while G@ is 
the grand total. Also 


(10.1) 


T, = total (over all replications) of all treatments that appear in 
the block which has the missing observation. 

B, = total of all blocks in which the treatment with the missing 
observation appears. 


Example. Suppose that the observation 2.20 for treatment 1 in rep- 
lieate 1 of table 10.1 had been missing. Form the block, replicate, treat- 
ment, and grand totals, just as in the ordinary analysis. It is helpful to 
insert an x for the missing observation and to include this x in all totals 
where it should appear. When the value of x has been found, it can then 
be inserted in all the appropriate places and the data are ready for com- 
puting the analysis of variance. The list of treatment totals appears as 
shown below. 


OCONAEIMPWNE 
rs 
i) 
= 


G = 55.22 +2 


The quantities needed are 
T=477, B=4.02, R=1287, G=5b.22 
Ty = 4.77 + 7.36 + 8.05 = 20.18 
B, = 4.02 + 3.54 + 4.08 + 4.77 = 16.41 
Hence 
(9) (4.77) + (12) (4.02) — 12.87 + 55.22 — (3) (20.18) — (8) (16.41) 
12 


= 1,98 
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The analysis of variance is now computed in the usual way, except 
that the total degrees of freedom are reduced from 35 to 34 and the 
intra-block error degrees of freedom from 16 to 15. The same adjust- 
ment factor » may be used (there is actually a slight change which can 
be ignored). 

When several observations are missing, the values are inserted by the 
method of successive approximation (illustrated in section 4,25). The 
exact formulae for t-tests are complicated. The following approximate 
rule assigns an effective number of replicates to each of two treatments 
A and B whose means are being compared. In any replicate of the ex- 
periment, A is credited with 0 replication if it is absent; with 0 replica- 
tion if A is present but B occurs in the same block and is absent; and with 
1 replication otherwise. The same rule is applied to B. For instance, in 
a t-test of the difference between treatments (1) and (3) in table 10.1, 
treatment (1) is credited with 3 replications and treatment (3) with 3 
replications (it loses 1 replication because in the first replicate of the ex- 
periment treatment (1) is in the same block and is missing). On the 
other hand, in comparing treatment (1) with treatment (4), the effective 
numbers of replications are 3 and 4, respectively. 


10.2 Partially Balanced Lattices 


10.21 Simple Lattices (Two Replicates). For an experiment with 2 
replicates, use the first 2 replicates of the appropriate set shown below 
(plans 10.1-10.6). Notice that designs are available for 36, 100, and 
144 treatments (plans 10.7-10.9) as well as for those numbers of treat- 
ments for which balanced designs are given. Plans for 121 and 169 
treatments can be taken from plans 12.7 and 12.8. 

The asymmetry of the designs is apparent from the plans; thus, with 
9 treatments (plan 10.1) the first treatment appears in the same block 
as treatments (2), (3), (4), and (7), but not in the same block as any of 
the remaining treatments. 

For 9 and 16 treatments, simple lattices are unlikely to be more ac- 
curate than randomized blocks unless the variation among incomplete 
blocks is great compared with that within incomplete blocks. Further, 
the numbers of degrees of freedom for estimating the error are only 4 and 
9, as against 9 and 16, respectively, for randomized blocks. 


10.22 Triple Lattices (Three Replicates). The first 3 replicates of 
plans 10.1-10.9, 12.7, and 12.8 are used. Designs may be obtained for 
all squares from 9 to 169. With 9 treatments the precaution mentioned 
in section 10.21 applies also to the triple lattice. 
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10.23 Four Replicates. These designs may be obtained either (i) by 
duplicating the simple lattice or (ii) by using a quadruple lattice, ie., 
the first 4 replicates of the plans. With 36 or 100 treatments, only the 
first method can be used, since no quadruple lattice exists. The second 
procedure is slightly preferable, because the resulting design comes 
closer to symmetry, but the statistical analysis requires more time. 


10.24 Five Replicates. The first 5 replicates from the balanced set 
are taken. Designs are not included for 9, 36, 100, or 144 treatments. 


10.25 Higher Numbers of Replicates. Balanced designs should be 
used for the following numbers of treatments and replicates: 25, 6; 49, 
8; 64, 9; and 81, 10. For 9 treatments in 8 or 12 replicates, 16 treatments 
in 10 replicates, and 25 treatments in 12 replicates, the plan for the bal- 
anced design should be repeated. 

In other cases the following recommendations are made. 


Sia replicates. Use the triple lattice twice. 

Eight replicates. Use the simple lattice four times or the quadruple 
lattice twice. ; 

Nine replicates. Use the triple lattice three times. 

Ten replicates. Use the simple lattice ‘five times or the quintuple lattice 
twice. 


The designs recommended are not always as fully balanced as they 
might be made. For instance with 6 replicates of 16 treatments the bal- 
anced design (5 replicates) plus | extra replicate gives a more nearly 
symmetrical arrangement, than the triple lattice used twice. Since 
separate computing instructions would be required for such plans, we 
have preferred to adhere to a more uniform system. For this reason no 
account is given of designs with 7 replicates, though their statistical 
analysis presents no great difficulty to the reader who has mastered the 


principles. 


10.26 Arrangement of Experimental Material. In the arrangement of 
a group of experimental material so as to apply one of those designs, the 
most important criterion is that units within the same incomplete block 
be homogeneous. In field trials, for example, if the plots are oblong the 
usual procedure is to have the incomplete blocks as nearly square as 
practicable, the plots extending the whole length of one side of the block. 
Uniformity trial investigations have shown that this layout gives on the 
average the most homogeneous block. In eases where there is detailed 
knowledge of the experimental site, some other method of grouping may 
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be superior. Efforts should be made to keep the experimental technique 
uniform for all units in the same block. Changes in technique that are 
necessary should be made when changing from one block to another. 

When the blocks have been formed, there is a secondary advantage in 
forming replications so that blocks within the same replicate are as sim- 
ilar to one another as the material permits, since this increases the pre- 
cision of inter-block comparisons. If the full statistical analysis is carried 
out, it is more important to have the incomplete blocks homogeneous 
than to have the replications homogeneous. Thus, when the grouping 
of blocks into compact replications is troublesome, this criterion may 
be ignored without much loss of precision. 

On the other hand, it is desirable to have homogeneous replications if 
the data may subsequently be analyzed as a randomized block design. 
As we have pointed out, the randomized block analysis may be needed 
in fields of work where certain experimental units or whole treatments 
are likely to be destroyed during the course of the experiment (e.g., in 
pasture plots subject to winterkill). 


10.27 Randomization. The randomization consists of three steps. 

1. Randomize the blocks separately and independently within each 
replication. 

2. Randomize the treatments separately and independently within 
each block. 

3. Allot the treatments to the treatment numbers at random. 

For methods of randomization, see chapter 15. 

Steps 1 and 2 give each treatment an equal chance of being allotted to 
any experimental unit. These steps correspond to the allotment of 
treatments to units at random in an ordinary randomized block design. 

The function of step 3 is to decide by random choice which groups of 
treatments will form the blocks of the design. If differences among blocks 
are large, the error variance per plot for the mean of a group of treat- 
ments which lie in the same block may be considerably higher than the 
average error variance. This additional randomization ensures that the 
average error variance may be used, in nearly all cases, for comparisons 
among groups of treatments. For further discussion, see reference (10.4). 

When a plan is repeated in order to obtain extra replications, a separate 
randomization must be made for every replicate. 


10.28 Statistical Analysis. An account of the theory, with worked 
examples of the simple and triple lattice, is given in reference (10.5); a 
systematic presentation of the shortest computational methods, with 
worked examples of the simple, quadruple, and balanced lattices in 
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(10.6); and a detailed description of the methods for carrying out the 
computations on I.B.M. (Hollerith) punched-card machines in (10.7). 
The computational methods differ slightly from one reference to another, 
but all are basically the same. 

Table 10.3 shows the plan and yields for a 5 X 5 simple lattice ex- 
periment on soybeans, the treatments being 25 varieties produced in the 


TABLE 10.3 Yieups or A 5 X 5 SIMPLE LATTICE EXPERIMENT ON SOYBEANS 


(Yields are in bushels per acre, minus 30 bu.) 


Rep. I B C nC 
(1) 6 (2) 7 (3) 5 (4) 8 (5) 6] 32 + 61 + 9.5 
(6) 16 (7) 12 (8) 12 (9) 13 (10) 8} 61 - 8 — 13 
(1) 17 (12) 7 (13) 7 (14) 9 (15) 14 | 54 + 48 + 7.5 
(16) 18 (17) 16 (18) 18 (19) 13 (20) 14 74 - 15 — 23 
(21) 14 (22) 15 (23) 11 (24) 14 (25) 14} 68 +17 + 2.7 
289 +103 +16.1 
Rep. II 
(1) 24 (6) 13 (11) 24 (16) 11 (21) 8} 80 = 19 -— 14 
(2) 21 (7) 11 (12) 14 (17) 11 (22) 23 | 80 — 23 — 3.6 
(8) 16 (8) 4 (13) 12 (18) 12 (23) 12 | 56 - 8 - 13 
(4) 17 (9) 10 (14) 30 (19) 9 (24) 23 | 89 — 32 - 5.0 
(5) 15 (10) 15 (15) 22 (20) 16 (25) 19 | 87 — 31 — 4.8 
392 —103 —16.1 
Treatment totals (unadj.) me 
(1) 30 (2) 28 (8) 21 (4) 25 (6) 21 | +9.5 
(6) 29 (7) 23 (8) 16 (9) 23 (10) 238 | —1.3 


(11) 41 (12) 21 (13) 19 (14) 39 (15) 36 | +7.5 
(16) 29 (17) 27 (18) 25 (19) 22 (20) 30 | —2.3 
(21) 22 (22) 38 (23) 23 (24) 37 (25) 33 | +2.7 


uC —1.4 —3.6 —13 —5.0 —4.8 0.0 


Treatment totals (adj.) 


(1) 38.1 (2) 33.9 (8) 29.2 (4) 29.5 (5) 25.7 
(6) 26.3 (7) 18.1 (8) 13.4 (9) 16.7 (10) 16.9 
(11) 47.1 (12) 24.9 (18) 25.2 (14) 41.5 (15) 38.7 
(16) 25.3 (17) 21.1 (18) 21.4 (19) 14.7 (20) 22.9 
(21) 23.3 (22) 37.1 (23) 24.4 (24) 34.7 (25) 30.9 


soybean breeding program of the North Carolina Experiment Station. 
The experiment actually contained 4 replications, obtained by duplicat- 
ing the plan for the simple lattice. The first 2 replications will serve to 
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illustrate the analysis for simple, triple, and quadruple lattices, which all 
follow the same general pattern. In section 10.29 the data from the 
whole experiment will be analyzed in order to indicate the procedure 
when the basic plan is repeated. 

1. Find the block totals B, the replication totals, the treatment totals, 
and the grand total. 

2. For each block calculate 


C = total (over all replicates) of all treatments in the block — rB 
For example, for block 1 in replication 1, 
C = 30 + 28 + 21 + 25 + 21 — (2)(82) = +61 


Find the replicate totals R, of the C values. These replicate totals 
should add to zero. 
3. The analysis of variance is as follows. 


df. 8.8. m.8. 
Replications (r — 1) 1 212.18 
Treatments (unadj.) ( — 1) 24 559.28 
Blocks within replications (adj.) r(k — 1) 8 501.84 62.732, 
Intra-block error (k—1)rk —k—1) 16 218.48 13.662, 
Total (rk? — 1) 49 1491.78 


All sums of squares are found in the usual way except that for blocks 
(adjusted), which is given by 


LO LRP _ G1)? +.B)? +-++-+ Bl)? (103)? + (103)? 
kr(r — 1) kr(r — 1) 10 50 
= 501.84 


where R, denotes a replication total of the C’s. 
4. The weighting factor used to obtain the adjusted treatment totals is 


_ (Ey—E,) _ (62.73 — 13.66) 
kr — DE (5) (62.73) 


' where Hy and £, are, respectively, the mean squares for blocks and intra- 
block error. If Z; is less than F,, the factor is taken as zero and no ad- 
justments are made for block effects, the experiment being analyzed as if 
in randomized blocks. 

Each C value is now multiplied by u to obtain the block corrections 
uC. These should sum to zero, apart from rounding errors. Each treat- 
ment total is adjusted by applying the appropriate correction for every 


0.1564 
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block in which the treatment appears. For example, the adjusted total 


for treatment 4 is 
25 + 9.5 — 5.0 = 29.5 


In the case of the simple lattice it pays to write the nC values around the 
table of unadjusted treatment totals, putting the values for replication. 
1 in the right-hand column and those for replication 2 in the bottom row. 
Then the adjustments to any treatment total are at the end of the row 
and the bottom of the column in which the treatment lies. 

5. The error variance of the difference between two treatment means 
is slightly smaller for treatments that appear in the same block than for 
those that do not. The formulae are: 


Two treatments in the same block: 


2E. (2)(13.66) 


: +e —-Dsl= 5 {1 + 0.1564] = 15.80 


Two treatments not in same block; 


2H 
= {t+ ral = a ft + (2)(0.1564)] = 17.93 


_2Ee rkp = (2)(13.66) (2)(5)(0.1564) ] _ 
Average: . [1 G+ 5 | 2 [1 + 6 | = 17,22 


The corresponding standard errors are 3.97, 4.23, and 4.15, respec- 
tively. Except with small designs it is sufficient to use the average value, 
4.15, for all tests between pairs of treatments. 

6. The analysis of variance does not supply an F-test of the adjusted 
treatment totals. A test of the unadjusted treatment totals can be ob- 
tained by analyzing the data as if the experiment were in randomized 
blocks. The error m.s. is the pooled mean square for blocks and intra- 
block error, as shown below. 


df. 8.8. m.8. 
‘Treatments (unadj.) (2 — 1) 24 559.28 «28.30 
Error ( —1)(r —1) 24 720.32 30.01 


This test is not fully sensitive, since it is based on unadjusted treatment 
totals, but it will be sufficient in cases where an F-test is of secondary in- . 
terest or where differences among incomplete blocks are small. 

‘An F-test of the adjusted treatment totals requires a little more cal- 
culation. This is most easily done by changing the unadjusted treat- 
ments s.s. (which is already available) so that it may be tested against 
the intra-block error m.s. Calculate B,, the unadjusted sum of squares 
for blocks within replications, which comes to 350.00. If B, is the ad- 
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justed sum of squares for blocks within replications (501.84), we sub- 
tract the following quantity from the treatments s.s. 


Re Gels (F - aa =p a Bie B.| 


= (5)(1)(0.1564) 


2 
Peete (350.00) — (601.84) | = —85.30 
1 + (5)(0.1564) | 
In this case the subtraction term is negative, which is rather unusual. 
We add 85.30 to the unadjusted treatments s.s. The F-test is then com- 
pleted by the following analysis. 


df. 8.8. m.s. 
Treatments 24 644.58 26.86 
Intra-block error 16 218.48 13.66 


The F-ratio, 1.97, lies between the 10 and the 5% levels. The F value 
differs considerably from that obtained by the randomized blocks test, 
as may happen when the adjustment for incomplete block effects results 
in a substantial increase in precision. 

7. The error m.s. in the randomized blocks analysis was found to be 
30.01. To estimate the gain in accuracy over randomized blocks, we 
compare this figure with the effective error variance 

rh [! a oon] 
(k + 1) 6 


The relative accuracy is 30.01/17.22, or 174%. 


E. [: + | = (13.66) = 17.22 


10.29 Statistical Analysis for Repetitions of the Designs. Suppose 
that the basic design contains n replicates (where, e.g.,n = 2 for a simple 
lattice) and that this is repeated p times, so that the total number of 
replications is r = np. For illustration, we present a joint analysis of 
the data in tables 10.3 and 10.4 (pp. 283 and 289), which show 4 repli- 
cations of the 5 X 5 simple lattice. Replication III is a repetition of 
replication I, and IV of II. The field results have been rearranged so 
that treatments follow the same order within corresponding replications. 
Tn terms of our notation, n = 2, p = 2, andr = 4. 

1. Calculate the block totals, the replication totals, the treatment 
totals (shown in table 10.4), and the grand total. 

2. Arrange the block totals in supplementary tables as on the right of 
table 10.4. In these tables, blocks that contain the same set of i treat- 
ments are placed in the same row. Each table has k rows and p columns, 
while the number of tables ism. Form the row and column totals of each 
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table. For each group of similar blocks, compute the quantities 


C = total (over all replicates) of the treatments appearing in the 
group — n(total of the group of blocks) 


Thus for the first block in replications II and IV, which contains treat- 
ments (1), (6), (11), (16), and (21), 


CG =59 +51 +80 + 63 + 58 — 2(157) = —3 


Find the replication totals R, of the C values; these should add to zero. 
3. The analysis of variance is as follows. 


af. 8.8. m.s. 
Replications (r — 1) 3 226.19 
Treatments (unadj.) (k2 — 1) 24 791.24 
Blocks within replications (adj.) r(k — 1) 16 786.00 49.12Hy 
Component (a) np — 1)(k — 1) 8 164.72 
Component (b) n(k — 1) 8 621.28 
Intra-block error (k—U(rk —k—1) 56 761.56 13.60E. 


Total (rk? — 1) 99 2564.99 


The blocks s.s. contains two components, both of which are obtained 
from the supplementary table of block totals. 

Component (a) is a new component, which arises only when the design 
has been repeated. It is composed of the differences between the totals 
of blocks that contain the same set of treatments. Since each supple- 
mentary table has k rows and p columns, it may be analyzed as follows. 


df. 
Rows (k — 1) 
Columns (p — 1) 
Rows X columns (k — 1)(p — 1) ® 
Total (kp — 1) 


The sum of squares for component (a) is the sum of the rows X columns 
interactions over all n tables. It may be obtained from the following 


calculations. 
2 es 2 2 2 9 2. 
fale (82)? + (61)? +: +++ (75)? + (84)? _ (650)" + (749) _ 602.18 


To rau pk? = 50 
2 2 2 
Baws! (104)? + (142)? +---+ Q7)* _ (650)? + (749) = 309.28 
pk = 10 50 
2 2 2 2 24 (749)? 
Columns: 220" + toy + oes + (357)? _ (650) ie eens 


Component (a) s.s. = 602.18 — 309.28 — 128.18 = 164.72 
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The above is a general method that covers all cases. With a simple 
lattice and p = 2, it is quicker to calculate the differences between the 
totals of similar blocks. The sum of squares of deviations of these dif- 
ferences from their replication means, divided by 2k, gives the sum cf 
squares for component (a). 

Component (b) is the component that is present even when there are 


no repetitions. 
ye YS R2 


kr(n— 1) k?r(n — 1) 


4s (63)? + (29)? +---+ (48)? (99)? + (99)? 
- 20 100 


Component (b) s.s, = 


= 621.28 


. The sum of squares for blocks within replications (adjusted) is the pooled 
sum of squares for the two components. All other terms in the analysis 
of variance are computed in the usual way. 

4. If Hy is the pooled mean square for blocks, and H, that for error, 
the weighting factor is 


— P(E — Be) is (2)(35.52) 
K(r — p)Eo + (p — 1)E.] — (5)[(2)(49.12) + 13.60] 


where it will be recalled that p is the number of repetitions of the basic 
design, in this case 2. As usual, no adjustments are made if Ey is less 
than F,. 

The block corrections nC are obtained and entered in table 10.4; they 
should add to zero except for rounding errors. Each treatment total is 
adjusted by applying a correction for each group of blocks in which the 
treatmeng appears. For instance, treatment (13) appears in the third 
block in replications I and II and in the third block in replications II 
and IV. Its adjusted total is 


44 + 8.8 — 0.1 = 52.7 


5. The error variances for the difference between two treatment means 
are: 


= 0.1270 


Two treatments in the same block: 


2E 2)(13.6 
1+ @ = Val = one) ©) 1 + 0.1270) = 7.70 
Two treatments not in same block: 
2E. 2)(13.66 
t+ mal = 208.68) [1 + (2)(0.1270)] = 8.56 


= 8.28 


2B, The (2)(13.66) (2)(5)(0.1270) 
Aversee: & 
Me [) aa 4 [ ES eae | 
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TABLE 10.4 A 5X5 stmere xartice (reps. IIT anp IV) 


Block totals 
Rep. III I Ill Sum C uC 


() 18 (2) 26 «(8) 9 (4) 18 (6) 11] 72 32 72 104 463 +80 
(6) 15 (7) 18 (8) 22 (9) 11 (10) 15| 81 61 81 142 -29 — 3% 
(11) 19 (12) 10 (43) 10 (14) 10 (45) 16] 65 54 65 119 +69 + 88 
(16) 21 (17) 16 (18) 17 (19) 4 (20) 17] 75 74 75 149 —34 — 43 
(21) 15 (22) 12 (28) 138 (24) 20 (25) 8| 68 68 G68 136 +30 + 38 


361 289 361 650 +99 +12.6 


Rep. IV Ir IV Sum C uC 


(1) 16 =) 7 (1) 20 (16) 13 (21) 21) 77 80 77 157 — 3 -— 0.4 
(2) 15 (7) 10 (12) 11 (7) 7 (22) 14] 57 80 57 187 2 0.3 
(3) 7 (8) 11 (43) 15 (18) 15 (23) 16| 64 56 64 1200 —1 —01 
(4) 19 (9) 14 (14) 20 (19) 6 (24) 16] 75 89 75 164 —49 — 6.2 
(5) 17 (10) 18 (15) 20 (20) 15 (25) 14) 84 87 84 Wl —48 — 6.1 


357 392 357 749 -99 —12.5 


Treatment totals (4 reps.) 
(l) 59 (2) - 69 (8) 37 (4) 57 (5) 49 +8.0 
(6) 51 (7) 51 (8) 49 (9) 48 (10) 56 —3.7 
(11) 80 (12) 42 (13) 44 (14) 69 (15) 72 +8.8 
(16) 63 (17) 50 (18) 57 (19) 32 (20) 62 —4.3 
(21) 58 (22) 64 (23) 52 (24) 73 (25) 55 +3.8 
—0.4 +0.3 —0.1 —6.2 —6.1 


Adjusted treatment totals 


(1) 66.6 @) 773 (8) 449 (4) 588 (5) 50.9 
(6) 46.9 (7) 47.6 (8) 45.2 (9) 381 (10) 46.2 
(11) 884 (12) Sl. (43) 527 (14) 71.6 (15) 74.7 
(16) 583 (17) 46.0 (18) 52.6 (19) 21.5 (20) 51.6 
(21) 614 (22) 68.1 (23) 55.7 (24) 70.6 (25) 52.7 


6. An approximate F-test of the adjusted treatment totals is obtained 
by the method deseribed in the previous section, step 6. In this case the 
quantity that must be subtracted from the treatments s.8. is 


ki 1 [ae B| 
Oe her =a aa 


where B,, is the unadjusted and Ba the adjusted sum of squares for com- 
ponent (b) of the blocks. Both quantities have already been computed: 
Ba, 621.28, appears in the analysis of variance, while Bu, 309.28, is the 


rows s.s. used in computing component (a). 
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7. A comparison with randomized blocks is made as in the previous 
section, step 7. The effective error variance of the lattice is taken as 


B. [i+ we | 
(+1) 


10.210 Missing Data. If the missing observations are numerous, an 
analysis by the method for randomized blocks is reeommended. If it is 
desired to carry out the full analysis, missing values are estimated by the 
formula which minimizes the intra-block error s.s., though as explained 
in section 10.13 this method is only approximate. 

Experiments with no repetition of the basic design. 


(r — 1)k?T — rR + G — rkC + kC’ 
r= 
(r — 1)(k — 1) (rk — k — 1) 
Tn this formula 7 and R are the totals for the treatment and replicate 
that contain the missing value, G is the grand total, and C is the C value 


(defined in section 10.28) for the block which contains the missing obser- 
vation. The only unfamiliar quantity is C’: 


(10.2) 


C’ = total of the C values for all blocks which contain the treat- 
ment that has the missing value 


Example. For the experiment analyzed in section 10.28, suppose that 
3 observations are missing: treatments (1) and (12) in replication I and 
TABLE 10.5 Summary Torans ror 4 5 X 5 SIMPLE LATTICE WITH THRE 
MISSING VALUES. 


Treatment totals 


(1) 24+2 (2) 28 (3) 21 (4) 25 (5) 21 
(6) 29 (1) 23 (8) 16 (9) 23 (10) 23 
(11) 41 (12) 14+y (13) 19 (14 9+2 (15) 36 
(16) 29 (17) 27 (18) 25 (19) 22 (20) 30 
(21) 22 (22) 38 (23) 28 (24) 37 (25) 33 
Rep. I Rep. II 
B ( B Cc 
26 +2 67 —« 80 a htie ete 
61 -8 80 —30+y 

47+ y 2 —y+z 56 -8 
74 Shs 59 +2 —2-2 
68 17 87 —31 


Ri =276 +2+y %—2-—y+z Re = 362+4+2 -8+2+y-—2 
G=R +e = 638+2+y+2 
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treatment (14) in replication II. Denote the estimates of the 3 missing 
values by a, y, and z, respectively; these will be obtained by the method 
of successive approximation. 

It is best to start by finding all block, replicate, and treatment totals 
and all C values, as in steps 1 and 2 of the computing instructions for 
this design. Wherever a missing observation is involved, include the ap- 
propriate , y, or z in its place. The summary totals are shown in table 
10.5. Note that the total of the C values over the whole experiment is 
identically zero, so that a good check js available on this part of the cal- 
culations. 

We must now find first approximations for y and z. In the replications 
in which they are present, their values are 14 and 9, respectively. How- 
ever, replication II appears to have higher values than replication I, and 
it is probably worth while to make an adjustment for replication effect. 
If we ignore the missing values, the means per plot are about 12 for 
replication I and 15 for replication II. Since y is missing in replication 
I, we subtract 3 from its value in replication IT, giving 11. For z we add 
3, giving 12 as the first approximation. Using y = 11, z = 12, we now 
solve for x from formula (10.2). For r = 2, k = 5 the formula becomes 

25T —2R +G— 10C + 5C’ 
ao 16 
Put y = 11,2 = 12. From table 10.5 we find for x 
T =24; R= 276411 = 287; G = 638 + 11 + 12 = 661 
C = 67; CG’ = 67 — 15 = 52 
(25) (24) — (2)(287) + 661 — (10) (67) + (5)(52) _ oh 
ie 16 16 
Put x = 17,2 = 12. For y we have 
T=14; R= 276 +17 + 293; G = 638 + 17 + 12 = 667 
C = 25 + 12 = 37; C’ =254+12-30=7 
(25) (14) — (2)(293) + 667 — (10)(37) + (5)(7) es 96 
Wl 16 16 
Put x =17,y =6. Forz we have 
T=9; R= 362; G@=638+17+6= 661; C= —2 
AOS Tei 
(25) (9) — (2)(862) + 661 — (10)(—2) + (5)(17) 267 YW 
aS 16 Te 


17 


=6 
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This completes the first round. A second round leads to x = 18, y = 
5, 2 = 17, and there is obviously no need for further calculation. The 
rest of the analysis proceeds as usual except that 3 df. are omitted from 
the total s.s. and the intra-block error s.s. For approximate t-tests, see 
section 10.13. } 

Experiments with repetitions of the basic design. If the basic design has 
n replicates and these are repeated p times to give r = np replications, 
the formula becomes 


(n — 1)k?T + (n — 1)rR + G — nkC + kC’ — nr?! 

(n — 1)(k — 1)(rk — k — 1) 
All symbols have the same meaning as in (10.2) except that C and C’ are 
now derived from the totals of groups of similar blocks, just as in the 


statistical analysis for this case in section 10.29. The new quantity R’ 
is defined as 


R’ = total of all replications that are similar to the replication con- 
taining the missing value 


(10.3) 


Note that this total includes the replication with the missing value. 

Bxample. Inthe 5 X 5 simple lattice with 4 replications, we have n = 
2,7 =4. If treatment (1) is missing in the first replication, the reader 
may verify that 


T = 58; R= 283; @ = 1393; C= 69; C’=69 -9 = 60 
R! = 283 + 361 = 644 


This gives 
Ws (25) (53) + (4) (283) + 1393 — (10)(69) + (5)(60) — (4)(644) 
56 
844 
=— = 16 
56 


10.3 Rectangular Lattices 


10.31 Description. These designs were developed recently by Harsh- 
barger (10.8) for k(& + 1) treatments in blocks of k units. They form a 
useful addition to the square lattices described in previous sections, since 
the allowable numbers of treatments, 12, 20, 30, 42, 56, 72, etc., fall about 
midway between the allowable numbers for square lattices. The sta- 
tistical analysis is quite similar to that for simple and triple lattices, 
though it takes more time because the block adjustments are not so 
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simple as with square lattices. The new designs are less symmetrical 
than the square lattices, in the sense that there is a greater variation in 
the accuracy with which two treatment means are compared. It will 
be recalled that only two standard errors are required for t-tests with the 
simple and triple lattice—one for two treatments that appear in the same 
block and one for two treatments that do not. The simple rectangular 
lattice requires four standard errors, while the triple rectangular lattice 
requires seven. For practical purposes it appears that these can nearly 
always be reduced to two. 

There are several ways of constructing the designs. One (suggested 
by G. S. Watson) is by means of a latin square with (& + 1) rows and 
columns, in which every letter in the leading diagonal is different. When 
writing down the square we attach a number to all letters except those 
in the leading diagonal, as illustrated below for a 4 X 4 square. 


A Bl C2 D3 
D4 C Bi AG 
BT A& D c9 
C10 Dil Al2 B 


In the first replication we place in a block all numbers that lie in the 
same row of the latin square, in the second replication all numbers that 
lie in the same column, and in the third all numbers that have the same 
latin letter. 


Block Rep. I Rep. II Rep. III 
1 ain, Vea or eae b ary 45 ~ (One He 
2a) td on 6 2 Oke, 2 92) 90; 
3 (Cree) LE ee Ome) 8! 3.0 ay 
4 10 ll 12 @) SAP 10 4 6 8 12 


The important feature of this arrangement is that no two treatments 
are in the same block more than once. The use of a latin square with 
different letters down the leading diagonal ensures that in the third. 
replication the three numbers associated with any letter are all in dif- 
ferent rows and columns, and hence have not previously occurred to- 
gether in a block. 

Plans 10.10-10.16 were constructed by this method. By using the 
first 2 replications from any plan we obtain a rectangular lattice in 2 
replications, which by means of repetitions can be used for an experi- 
ment in 4, 6, 8, ete., replications. By using all 3 replications of the plan 
we have designs for 3, 6, 9, etc., replications. 
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Alternatively (as pointed out by 8S. 8. Shrikhande), we may construct 
the designs from a balanced lattice with (k + 1)? treatments. If the 
first replication is omitted, and if all treatments that appear in any one 
selected block in the first replicate are omitted in subsequent replica- 
tions, it is easy to verify that we generate a design for k(k + 1) treat- 
ments in which every block is of size k and in which no two treatments 
appear together more than once in the same block. This method gives 
any number of replicates up to k, though of course the method fails when 
no balanced lattice exists, as with 36 treatments. The discussion in this 
section is limited to simple and triple rectangular lattices. 

The method of randomizing is the same as for lattices (section 10.27). 


10.32 Statistical Analysis. In each plan, the three basic replications 
are described as the X, Y, and Z replications. Further, if we take any 
block in one replication and examine another replication, we find that 
there is one and only one block in the other replication that has no treat- 
ments in common with the chosen block. These two blocks will be called 
partners. Since it is important to be able to distinguish partners in the 
statistical analysis, each block is denoted in the plan by two symbols, 
e.g., Y2, one to mark the replication and one to indicate the partners. 
All partners carry the same number: thus the partners of Y2 are X2 and 
42, It is advisable to write the block labels in the notebook in which 
the original results are recorded. 

Example. The example is a triple rectangular lattice for 12 treatments 
in blocks of 3 (k = 3). Artificial data were assembled by taking true 
treatment effects as shown in table 10.6, and adding to them true block 
effects as given with each block in table 10.7. Thus the observation 7 


TABLE 10.6 TRwarMENT BFFECTS IN ARTIFICIAL DATA FOR A 3 X 4 
TRIPLE RECTANGULAR LATTICE 


True treatment 


‘Treatment True effect total 
1 14 56 
2 7 35 
3 2 20 
4 0 14 
5 3 23 
6 11 47 
% 9 41 
8 16 62 
9 8 38 

10 1 17 
11 5 29 
12 6 32 
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for treatment (10) in block X4 of table 10.7 is obtained by adding the 
true treatment effect, 1, to the true block effect, 6. Since the average of 
the true block effects comes out to be 56/12, or 14/3, the true treatment 
totals, over the 3 replicates, will be (387 + 14), where 7 is the true treat- 


TABLE 10.7 PLaNn AND OBSERVATIONS FOR A 3 X 4 RECTANGULAR LATTICE 


(Treatment numbers are enclosed in parentheses) 


Block True block Totals Adjustment 
symbol effect Rep. X B Cx factor 
X4 6 (10) 7 (12) 12 (11) 11 30 —12 —0.6 
X1 2 (2) 9 (8) 4 (1) 16 29 20 +3.4 
X38 7 (7) 16 (9) 15 (8) 23 54 — 5 =1.6 
X2 0 4 0 6 3 @®ll 14 30 +5.4 
127 33 
Rep. Y B Cy 
Y4 9 (3) 11 (6) 20 (9) 17 48 —32 —4.6 
y2 3 (1) 17 (1) 8 (8) 19 44 10 +1.4 
Y3 3 (12) 9 (2)10 (5) 6 25 10 +1.4 
Yl 5 (10) 6 (4) 5 (7) 14 25 0 —0.6 
142 —12 
Rep. Z B Cz 
Zl 4 (8) 20 (6) 15 (12) 10 45 4 +0.2 
22 8 (9) 16 (10) 9 (2) 15 40 —16 —3.8 
Z3 1 ay 6 @) 3 (4 1 10 19 +3.2 
Z4 8 (5) 11 (1) 22) «(7 17 50 —28 —3.8 
145 —21 


Treatment totals 
Hees) Sea oP CON oe oes Oy 10 en 


Unadj. 55 34 18 6 20 46 47 62 48 22 25 31 
Adj. 56 35 20 14 23 47 41 62 38 17 29 32 


ment effect. These figures are shown in the column at the right in table 
10.6. 

No intra-block error has been introduced into the data, which there- 
fore provide two checks on the method of analysis. First, in the analysis 
of variance the intra-block error s.s. should be found to be identically 
zero. Second, the adjusted treatment totals as found from the analysis 
should be exactly equal to the true treatment totals. 

The computing instructions given below apply to either the simple or 
the triple rectangular lattice without repetitions. The changes required 
when there are repetitions are given later. 
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1. Find the block totals, B, the replication totals, the treatment totals, 


and the grand total. 
2. For each block calculate 


C = total (over all replicates) of all treatments in the block — rB 
Thus for block Y4, 
C = 18 + 46 + 48 — (8)(48) = —32 


As usual, the total of the C’s is zero. 

3. Arrange the C values in a supplementary table so that partners ap- 
pear in the same row. The row totals of this table, S, give the sums of 
the C' yalues for each set of partners. The column totals give values 
denoted by Re. 


TABLE 10.8 SuprLeMEnTARY TABLE OF C VALUES 


Block 
symbol Cx Cr Cz S Cx dACy dz pS 
dh 20 0 4 24 4.0 0.0 0.8 0.6 
2 30 10 —16 24 6.0 2.0 —3.2 0.6 
3 1} 10 19 24 —1.0 2.0 3.8 0.6 
4 —12 -—32 -—28 —72 —2,.4 -6.4 —5.6 —1.8 
Totals (Rc) 33-12 —21 0 6.6 -—2.4 —4.2 0.0 


4, The analysis of variance is now obtained. All sums of squares are 
found in the usual way, except that for blocks, adjusted for treatments, 
which is 


eter D Re? SDE 
rk —h- 1) rk+1)Ck-k-1), re—Y(k+ 1k —k — 1) 
= (20)?-+(80)?-++ - « - (28)? ay (33)?+(12)?+-(21)? (24)? ..4-(72)? 


15 60 120 
= 274.0 — 27.9 — 57.6 = 188.5 (10.4) 
TABLE 10.9 ANnatysis oF VARIANCE 
Source of variation af. 8.8. m.s. 
Replications (r — 1) 2 15.5 
Treatments (P +k — 1) i 1067.0 
Blocks rk 9 188.5 20.9E 
Intra-block error (r—)P—-1) —k 13 0.0 0.0E. 
Total (rk? + rk — 1) 35 1271.0 


As anticipated, the intra-block error s.s. in table 10.9 is zero, 


ae 
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5. We now calculate the weighting factors used to obtain the adjusted 
treatment totals. Although a general formula can be given, it is simpler 
to present these separately for simple and triple designs. Two weighting 
factors are necessary.* 


Simple rectangular lattice 


r(By — Ee) 
= (10.5) 
r(k — 1I)Ey + (rh — 2k + r) By 
Ar — E, 
r(By ) (10.6) 
r(k + 1)£y + (rk — 2k — rE, 
Triple rectangular lattice 
r(Ey — Ee) 
= (10.7) 
r(2k — 1)Ey + (rh — 3k + r)Ee 
\r(Ey — Ee 
ve r(By ) (10.8) 


Qr(k + 1), + (rk — 3k — 2r)E. 
In this example Z, is zero and we have 
3 0.2)(3 
ee eae a 0.2)(3)_  O08s 
(3)(5) ()(3)(4) 


6. Complete table 10.8 by adding the columns ACx, ACy, \Cz, and 
wS. The adjustment for any block is 


AC — pS 


where S is taken from the row in which C bi in table 10.8. Thus for 
block X1 the adjustment is 


+4.0 — 0.6 = +34 


oly 


These adjustments are recorded on the plan in table 10.7. Note that 
the order in which blocks appear is different in tables 10.7 and 10.8; care 
must be taken to see that each adjustment is given to the appropriate 
block. 

7. Finally, each treatment total is adjusted by adding the adjustments 
for every block in which the treatment appears. For instance, the 
adjusted total for treatment (4) is’ 


6454-06432 = 14 


* This method of estimating the weights is slightly different from that given by 
Harshbarger. In the interests of simplicity we have used only the pooled mean 
square for blocks and the intra-block error m.s. for estimating \ and y. 
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Every adjusted treatment total will be found to be equal to the true 
treatment total as given in table 10.6. 

Statistical analysis for repetitions of the designs. When the basic design 
with n replications is repeated p times to give r = np replications, the 
changes required in the steps of the analysis are noted below. 

1. No change. 

2a. For any block there are (p — 1) other blocks that have exactly 
the same set of treatments. Arrange the totals for such similar blocks 
in two-way tables, as exemplified below. There are n such tables. 


X Blocks 
Repetition Total 
Block no. Me jecno 5) (2 p) 


(k +1) 


Total 


2b. For each group of similar blocks calculate 

C = total (over all replicates) of all treatments in the group — n(>> 8) 
3. No change. 

4a, The blocks s.s. now has two components. For component (a) ob- 
tain the interaction s.s. for each of the tables in step 2a, and add these 
sums of squares. Since each block total contains k observations, a 
divisor k is required for the analysis of variance. 

4b. The sum of squares for component b is 


Dane Dd Re? ys? 
r(nk—k—-1)  rk+1)(mk-k—-1) r(m—1)(k4 1)(nk —k — 1) 


The separation of degrees of freedom in the analysis of variance is shown 
below. 


df. ms. 
Replications (r — 1) 
‘Treatments (+k —-1) 
Blocks J rk E, 
Component (a) (r —n)k 
Component (b) nk 


Intra-block error (r — 1)(K®7 —1)—k  #, 


Total (ri? + rk — 1) 
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5. The weighting factors in formulae (10.5)—(10.8) are unchanged. 
Note that Z, is the pooled mean square for blocks. 

6. Unchanged. 

7. Unchanged. 

As mentioned previously, in order to have ¢-tests for every pair of 
treatments we would require to present 4 standard error formulae for 
simple rectangular lattices and 7 for triple rectangular lattices. With 
very little inaccuracy these can be reduced to 2, one for the case where 
the two treatments appear in a block and one for the case where this is 
not so. For the difference between two adjusted means the estimated 
error variances are: 

Simple rectangular lattice 

2E. 
r 


Two treatments in the same block: aQ+A-2) 


2E. 
Two treatments not in same block: eee (1 + 2d — p) 
r 


2E.f 2k — 1) ; M—k+1 
Average: * [z 1+r—2) 4 $2 — | 
verge: * “|e paaet* Ot eeeor #) 
2B, 2k — (k2 +k — We 


ae; [1+ @+k—-D | 
Triple rectangular lattice 


2E. 
Two treatments in the same block: = + 2\ — u) 


2H, 
Two treatments not in same block: 5 = (1 +3 — sa 
2E.7 3(k — 1) ee te ( 3 )| 
A 2) SS ox = = (1 +3, —= 
Se [oe se + B) tas py er ay 


Tn the example, where \ = 0.2, » = 0.025, the factors multiplying the 
usual term 2H,/r are 1.375 for two treatments in the same block and 
1.562 for two treatments not in the same block. The factor in the aver- 
age variance is 1.460. 


10.4 Cubic Lattices 


10.41 Description. These designs were produced by Yates for plant- 
breeding work in which selections are to be made from an unusually large 
number of varieties. The number of treatments must be an exact cube. 


* Owing to the fact that only two standard errors were used, this average is not 
quite equal to the average variance taken over all possible pairs, though it is very 
close to that value. 
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The most useful range comprizes 27, 64, 125, 216, 348, 512, 729, and 
1000 treatments. The size of block is the cube root of the number of 
treatments, i.e., 3, 4, 5, 6, 7, 8, 9, and 10, respectively. Thus cubic lat- 
tices can accommodate a large number of treatments in a small size of in- 
complete block. The designs have been used, for example, in an ex- 
periment with 729 strains of ponderosa pine seedlings (10.9) and an ex- 
periment with 729 soybean varieties (10.10). The number of replicates 
must be 3 or some multiple of 3. 

Since the plans are easy to construct and since they occupy a con- 
siderable amount of space for the higher numbers of treatments, they are 
not reproduced here. To obtain a plan, the i? treatments aré numbered 
by means of a three-digit code in which each digit takes all values from 
1tok. For 27 treatments, the code is as follows: 


T* Code T Code T Code 
1 111 4 121 7 131 
2 211 5 221 8 231 
3 3il 6 321 9 331 

10 112 13 122 16 132 

11 2i2 14 222 17 232 

12 312 15 322 18 332 

19 113 22 123 25 133 

20 213 23 223, 26 233 

21 313 24 323 27 333 


* Treatment number. 


The same principle applies with a larger number of treatments. For 
the first & treatments, the last two digits are fixed at (11) while the first 
digit runs from 1 to k. The next k treatments are coded by fixing the 
last two digits at (21) while the first digit again runs from 1 to k, and so 
on in a systematic manner until the final & treatments are reached, for 
which the last two digits have the fixed values (kk). 

Within each of the 3 replications, the k® treatments are grouped into 
k? blocks, each of size k. In the first replication, the rule for this group- 
ing is to keep the last two digits constant within a block, allowing the 
first digit to take all values from 1 tok. Thus, in the example above, the 
9 groups of treatments constitute the 9 blocks, block 1 containing the 
treatments (111), (211), and (311). 

To form blocks in the second replication, we keep the first and last 
digits fixed within any block and give the second digit all values from 
1 to &, With 27 treatments, the first block therefore contains (111), 
(121), and (131), the second block (211), (221), and (231), and the last 
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block (313), (323), and (333). In the third replication, the first and 
second digits are constant within each block. 

The composition of the blocks in the second and third replications is 
shown in table 10,10. 


TABLE 10.10 Smconp AND THIRD REPLICATES OF A CUBIC LATTICE WITH 
27 TREATMENTS 


Rep. II 
Block (1) (2) (3) (4) (5) 
Code T* Code T Code T Code T Code T 


LLY HL 211 2 311. 3 112 10 212 11 
121 4 221 5 321 6 122 13 222 14 
Isl. % 231 8 9 132 16 232 17 


(6) (7) (8) (9) 
Code T Code T Code T Code T 


312 12 113 19 213 20 313° 21 
322 15 123 22 223 23 323 24 
332 18 133 25 233 26 333 27 


Block (1) (2) (3) (4) (5) 
Code 7 Code 7 Code T Code T Code T 


hAGiS aL 211 2 311 3 121. 4 221 5 
112 10 212 11 312 12 122 13 222 14 
113 19 213 20 313 21 123 22 223 23 


(6) 7) (8) (9) 
Code T Code T Code T Code T 


321 6 131 7 231 «8 331 9 
322 15 132 16 232 17 332 18 
323 24 133 25 233 26 333 27 


* Treatment number. 


The original treatment numbers are shown as well as the code num- 
bers. In an experiment it is simplest to record only the code numbers, 
which are required in order to follow the computing instructions. 

Cubie lattices may be expected to be most useful when the number of 
treatments exceeds 100. The designs for k = 3, 4, 5, and 6 lie in the 
range which is also covered by lattice designs. We do not know of any 
investigations of the relative accuracy of lattice and cubic lattice designs 
in these cases. With highly variable experimental material, the use of a 
smaller block might give the cubic lattice some advantage; on the other 
hand the statistical analysis is more laborious owing to the greater num- 
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ber of block adjustments to be calculated. Like the lattice designs, 
cubic lattices cannot be appreciably less accurate than randomized 
" plocks which occupy the same set of replications, and can be analyzed 
by the method for randomized blocks. 
Some cubic lattices can also be arranged in quasi-latin squares (10.11) 
so as to allow the elimination of two sources of error variation. 


10.42 Arrangement of Experimental Material. The most important 
rule is to have units in the same incomplete block as homogeneous as the 
experimental material permits; in field trials the blocks should be com- 
pact in shape. If practicable, blocks within the same replicate should 
also be similar to each other, so as to increase the accuracy of inter-block 
comparisons and to obtain more precise results if the analysis is sub- 
sequently carried out by the method for randomized blocks. 


10.43 Randomization. The steps are: 

1. Randomize the order of the blocks independently within each rep- 
lication. 

2. Randomize the positions of the treatment code numbers inde- 
pendently within each block. 

3. Assign treatments to code numbers at random. 


10.44 Statistical Analysis. The following reference should be con- 
sulted. 


(10.12) Yarns, F. The recovery of inter-block information in variety 
trials arranged in three-dimensional lattices. Ann. Eugen. 9, 
136-156, 1939. 


General formulae are given with a numerical example of the analysis for 
an experiment with 64 treatments. In following this example, the 
reader should note that the experiment was not arranged in separate 
replications; the changes introduced for this reason are pointed out by 
Yates (pp. 147-148). The method of analysis when 6 or 9 replicates are 
used is also indicated (p. 144). Reference (10.13) describes the analysis 
of an experiment with 125 treatments. 


10.45 Error Variances. The formulae for the error variance of the 
difference between two treatment means are summarized below. In the 
analysis as described by Yates (10.12), the symbols 

w—w! w—w! 


wt Qu? Op ew! 
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play a prominent part. It is convenient to write ’ = r/R, wl = w/e. 
The error variances for the different types of treatment comparisons are 
as follows: 


Comparison Variance of difference between means 
2B, 
tn — bu © ft + 2n + 2k — ze’) 
2E. . 
tie — tin = £1 + 4n’ + (8k — 4)u’] 
2B. ‘ 
teog — hun as [1 + 6’ + 3(k — 2)p'] 
2k. is 
A =| oe Ce Wray, mye | 
erage - Woe @+k+D {6n’ + 8% — 1)u’} 


Here 2, is the intra-block error m.s. and r the number of replicates 
(3, 6, or 9). The formulae, though written in a slightly different form, 
are identical with those given by Yates. Our k corresponds to Yates’s 
p, and our r to his n. 


10.1 


10.2 


10.3 


10.4 
10.5 


10.6 


10.7 
lanl 


10.8 
10.9 


10.10 
10.11 


10.12 


10.13 
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PLANS 
Plan 10.1 3 X 3 balanced lattice 
t=9,k =3,r=4,b =12,=1* 
Block Rep. I Rep. II Rep. III Rep"IV 
hy Le 4 147 (Coy slate) (10) 1 8 6 
(2) 4 5 6 (5) 25 8 (8) 7 2 6 (ll) 4 2 9 
3) 7 8 9 (6) 3 6 9 (9) 48 3 (2) 75 8 
Plan 10.2 4 X 4 balanced lattice 
t= 16,k =4,r =5,b = 20,2=1 
Block Rep. I Rep. II Rep. III 
(15) US ee ee (5) 1 5 9 18 KO) e611) 16 
(2) Pyle SCM Retin (6) 2 6 10 14 (10) 5 2 15 12 
(3) 9 10 11 12 (CAMESE Sie L1sei5. (tH); sors" 3. 8 
(4) 18 14 15 16 (8) 4 8 12 16 (12) 18 10 7 4 
Rep. IV Rep. V 
GLO) esel ee TE Ze N12. Dwr 10: 158 
(14) 18 2 11 8 18): OWN 2275 16) 
(15) 5 10 3 16 (19) 18 6 3 12 
(é) 9 6 165 4 (20) 5 144i 4 
Plan 10.3 5 X 5 balanced lattice 
t=25,k =5,r =6,b =30,+ =1 
Block Rep. I Rep. II Rep. III 
jie ian Bie 4: '6 (6) ae Ge 1 1G 2 (De Vat Aas 19" 25 
(2) GT acco 10% 10 ( 2 7 12 17 22 (12) 21 2 8 14 20 
(3) 11 12 13 14 15 (83) 38 8 13 18 23 (13) 16 22 3 9 15 
(4) 16 17 18 19 20 (9) 4 9 14 19 24 (14) 11 17 23 4 10 
(5) 21 22 23 24 25 (10) 5 10 15 20 25 (15) 6 12 18 24 5 
Rep. IV Rep. V Rep. VI 
(16) 1 12 23 9 20 (21) 117 8 24 15 (26) 1 22 18 14 10 
(17) 16 2 13 24 10 (22) 11 2 18 9 25 (27) 6 2 23 19 15 
(18) 6 17 8 14 25 (23) 21 12 3 19 10 (28) 11 7 38 24 20 
(19) 21 7 18 415 (24) 6 22 18 4 20 (29) 16 12 8 4 25 
(20) 11 22 8 19 5 (25) 16 7 23 14 5 (30) 21 17 18 9 5 


*The symbol \ denotes the number of times that two treatments appear in the 
same block. 


Plan 10.4 


Block 
(1) 
(2) 
(3) 
(4) 
(5) 
(6) 
(2) 


(15) 
(16) 
(17) 
(18) 
(19) 
(20) 
(21) 


(29) 
(30) 
(31) 
(82) 
(33) 
(34) 
(35) 


(43) 
(44) 
(45) 
(46) 
(47) 
(48) 
(49) 
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PLANS 
7 X 7 balanced lattice 
1=49,k =7,r =8,b =56,4 =1 

Rep. I Rep. II 
M28) RAS (8) ‘1 8 15 22 29 36 43 
8 9 10 Il 12 13 14 9) 2 9 16 23 30 37 44 
15 16 17 18 19 20 21 (10) 8 10 17 24 31 38 46 
22 23 24 25 26 27 28 (11) 4 11 18 25 32 39 46 
29 30 31 32 33 34 35 (12) 5 12 19 26 33 40 47 
36 37 38 39 40 41 42 (13) 6 13 20 27 34 41 48 
438 44 45 46 47 48 49 (14) 7 +14 21 28 35. 42 49 

Rep. III Rep. IV 
1 9 17 25 33 41 49 (22) 1 387 24 11 47 34 21 
43 2 10 18 26 34 42 (23) 15 2 88 25 12 48 35 
86 44 38 11 19 27 35 (24) 29 16 3 39 26 18 49 
29 37 45 4 12 20 28 (25) 48 80 17 4 40 27 14 
22 30 38 46 5 13 21 (26) 8 44 81 18 5 41 28 
15 23 31 39 47 6 14 (27) 22 9 45 32 19 6 42 
8 16 24 32 40 48 7 (28) 36 23 10 46 33 20 7 

Rep. V Rep. VI 
1 30 10 39 19 48 28 (36) 1 28 45 18 40 13 35 
22 2 381 11 40 20 49 (87) 29 2 24 46 19 41 14 
48 23 3 82 12 41 21 (88) 8 30 3 25 47 20 42 
15 44 24 4 33 13 42 (39) 86 9 81 4 26 48 21 
36 16 45 25 5 34 14 (40) 15 87 10 382 5 27 49 
8 37 17 46 26 6 35 (41) 48 16 38 11 33 6 28 
29 9 38 18 47 27 7 (42) 22 44 17 39 12 34 7 

Rep. VII Rep. VIII 

1 16 81 46 12 27 42 (50) 1 44 88 32 26 20 14 
36 2 17 32 47 13 28 (51) 8 2 45 39 33 27 21 
22 37 3 18 33 48 14 (52) 15 9 3 46 40 34 28 
8 23 388 4 19 34 49 (53) 22 16 10 4 47 41 35 
43 9 24 389 5 20 35 (54) 29 28 17 11 5 48 42 
29 44 10 25 40 6. 21 (55) 36 30 24 18 12 6 49 
15 30 45 i1 26 41 7 (56) 48 37 31 25 19 18 7 
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Plan 10.5 8 X 8 balanced lattice 
t= 64,k =8,r=9,b=72,\=1 
Block Rep. I Rep. IL 
ie oes ey SIS Oe had) (9) 1 9 17 25 33 41 49 57 
(2) 9 10 11 12 138 14 15 16 (10) 2 10 18 26 34 42 50 58 
(8) 17 18 19 20 21 22 23 24 (11) 3 11 19 27 35 43 51 59 
(4) 25 26 27 28 29 30 31 32 (12) 4 12 20 28 36 44 52 60 
(5) 83 84 35 36 37 38 39 40 (13) 5 18 21 29 87 45 53 61 
(6) 41 42 43 44 45 46 47 48 (14) 6 14 22 30 38 46 54 62 
() 49 50 51 52 53 54 55 56 (15) 7 15 23 31 89 47 55 63 
(8) 57 58 59 60 61 62 63 64 (16) 8 16 24 32 40 48 56 64 
Rep. III Rep. IV 
(17) 1 10 19 28 37 46 55 64 (25) 1 18 27 44 13 62 39 56 
(18) 9 2 51 44 61 30 23 40 (26) 17 2 35 60 53 46 31 16 
(19) 17 50 3 36 29 62 15 48 (27) 25 34 8 12 45 54 23 64 
(20) 25 42 35 4 21 14 63 56 (28) 41 58 11 4 29 22 55 40 
(21) 83 58 27 20 5 54 47 16 (29) 9 50 438 28 5 38 638 24 
(22) 41 26 59 12 53 6 39 24 (80) 57 42 51 20 37 6 15 32 
(23) 49 18 11 60 45 38 7 32 (81) 33 26 19 52 61 14 7 48 
(24) 57 84 43 52 13 22 31 8 (82) 49 10 59 36 21 30 47 8 
Rep. V Rep. VI 
(33) 1 26 43 60 21 54 15 40 (41) 1 34 11 20 53 30 63 48 
(34) 25 2 11 52 87 62 47 24 (42) 33 2 59 28 45 22 15 56 
(35) 41 10 3 20 61 38 381 56 (43) 9 58 3 52 21 46 39 32 
(36) 57 50 19 4 45 30 39 16 (44) 17 26 51 4 13 38 47 64 
(37) 17 34 59 44 5 14 55 32 (45) 49 42 19 12 5 62 31 40 
(38) 49 58 35 28 13 6 28 48 (46) 25 18 438 36 61 6 55 16 
(89) 9 42 27 36 538 22 7 64 (47) 57 10 35 44 29 54 7 24 
(40) 33 18 o1 12 29 46 638 8 (48) 41 50 27 60 87 14 23 8 
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Plan 10.5 (Continued) 8 X 8 balanced lattice. 
Rep. VII Rep. VIII 
(49) 1 42 59 52 29 38 23 16 (57) 1 50 35 12 61 22 47 32 
(50) 41 2 19 36 13 54 63 32 (58) 49 2 43 20 29 14 39 64 
(51) 57 18 3 28 53 14 47 40 (59) 38 42 3 60 13 30 55 24 
(52) 49 84 27 4 61 46 15 24 (60) 9 18 59 4 37 54 31 48 
(53) 25 10 51 60 5 22 39 48 (61) 57 26 11 386 5 46 23 56 
(54) 33 50 11 44 21 6 31 64 (62) 17 10 27 52 45 6 63 40 
(55) 17 58 43 12 37 30 7 56 (63) 41 34 51 28 21 62 7 16 
(56) 9 26 35 20 45 62 55 8 (64) 25 58 19 44 53 38 15 8 
Rep. IX 
(65) 1 58 51 86 45 14 31 24 
(66) 57 2 27 12 21 38 55 48 
(67) 49 26 3 44 37 22 63 16 
(68) 33 10 43 4 58 62 23 32 
(69) 41 18 385 52 5 30 15 64 
(70) 9 34 19 60 20 6 47 56 
(71) 2% 50 59 20 13 46 7 40 
(72) 17 42 11 28 61 54 39 8 
Plan 10.6 9 X 9 balanced lattice 
t=81,k =9,r =10,b =90,\=1 
Block grep. I Rep. IT 
@) AEP Enon Oman) (10) 1 10 19 28 37 46 55 64 73 
(2) 10 11 12 13 14 15 16 17 18 (11) «2 11 20 29 38 47 56 65 74 
(3) 19 20 21 22 23 24 25 26 27 (12) 3 12 21 30 39 48 57 66 75 
(4) 28 29 30 81 32 33 34 35 36 (13) 4 13 22 31 40 49 58 67 76 
(5) 37 388 39 40 41 42 43 44 45 (14) 5 14 28 82 41 50 59 68 77 
(6) 46 47 48 49 50 51 52 53 54 (15) 6 15 24 33 42 51 60 69 78 
(7) 55 56 57 58 59 60 61 62 63 (16) 7 16 25 34 43 52 61 70 79 
(8) 64 65 66 67 68 69 70 71 72 (17) 8 17 26 35 44 53 62 71 80 
(9) 73 74 75 76 77 78 79 80 81 (18) 9 18 27 36 45 54 63 72 81 
Rep. III Rep. IV 
(19) 1 20 12 58 77 69 34 53 45 (28) -1 11 21 31 41 51 61 71 81 
(20) 10 2 21 67 59 78 43 35 54 (29) 19 2 12 49 32 42 79 62 72 
(21) 19 11 38 76 68 60 52 44 36 (30) 10 20 3 40 50 33 70 80 63 
(22) 28 47 39 4 23 15 61 80 72 (31) 55 65 75 4 14 24 34 44 54 
(23) 37 29 48 13 5 24 70 62 81 (32) 73 56 66 22 5 15 52 35 45 
(24) 46 38 30 22 14 6 79 71 63 (38) 64 74 57 13 23 6 48 58 36 
(25) 55 74 66 31 50 42 7 26 18 (34) 28 38 48 58 68 78 7 17 27 
(26) 64 56 75 40 32 51 16 8 27 (35) 46 29 39 76 59 69 25 8 18 
(27) 73 65 57 49 41 33 25 17 9 (36) 37 47 30 67 77 60 16 26 9 
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(87) 
(38) 
(39) 
(40) 
(41) 
(42) 
(43) 
(44) 
(45) 


(55) 
(56) 
(57) 
(58) 
(59) 
(60) 
(61) 
(62) 
(63) 


(73) 
(74) 
(75) 
(76) 
(77) 
(78) 
(79) 
(80) 
(81) 
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Plan 10.6 (Continued) 9 X 9 balanced lattice 

Rep. V Rep. VI 
1 29 57 22 50 78 16 44 72 (46) 1 56 30 13 68 42 25 80 54 
55 2 30 76 23 51 70 17 45 (47) 28 2 57 40 14 69 52 26 81 
38 56 3 49 77 24 43 71 18 (48) 55 29 3 67 41 15 79 53 27 
10 38 66 4 32 60 25 53 81 (49) 19 74 48 4 59 83 16 71 45 
64 11 39 58 5 83 79 26 54 (50) 46 20 75 31 5 60 43 17 72 
87 65 12 31 59 6 52 80 27 (51) 78 47 21 58 32 6 70 44 18 
19 47 75 13 41 69 7 35 63 (62) 10 65 39 22 77 51 7 62 36 
73 20 48 67 14 42 61 8 36 (58) 37 11 66 49 23 78 34 8 63 
46 74 21 40 68 15 34 62 9 (54) 64 38 12 76 50 24 61 35 9 

Rep. VII Rep. VIII 
1 47 66 76 14 38 43 62 27 (64) 1 74 89 67 32 24 52 17 63 
64 2 48 81 77 15 25 44 63 (65) 37 2 75 22 68 33 61 53 18 
46 65 3 13 32 78 61 26 45 (66) 78 38 3 31 23 69 16 62 54 
37 56 21 4 50 69 79 17 36 (67) 46 11 57 4 77 42 70 35 27 
19 38 57 67 5 51 34 80 18 (68) 55 47 12 40 5 78 25 71 36 
55 20 30 49 68 6 16 35 SI (69) 10 56 48 76 41 6 34 26 72 
73 11 30 40 59 24 7 53 72 (70) 64 29 21 49 14 60 7 80 45 
98 74 12 22 41 60 70 8 54 (71) 19 65 30 58 50 15 43 8 81 
10 29 75 58 23 42 52 71 9 (72) 28 20 66 13 59 51 79 44 9 

Rep. IX Rep. X 
1 65 48 40 23 60 79 35 18 (82) 1 388 75 49 59 15 70 26 36 
46 2 66 58 41 24 16 80 36 (83) 73 2 39 13 50 60 34 71 27 
64 47 8 22 59 42 34 17 81 (84) 37 74 3 58 14 51 25 35 72 
73 29 12 4 68 51 43 26 63 (85) 64 20 80 4 41 78 52 62 18 
10 74 30 49 5 69 61 44 27 (86) 28 65 21 76 5 42 16 53 63 
28 11 75 67 50 6 25 62 45 (87) 19 29 66 40 77 6 61 17 54 
37 20 57 76 32 15 7 71 54 (88) 46 56 12 67 23 83 7 44 81 
55 88 21 18 77 383 52 8 72 (89) 10 47 57 31 68 24 79 8 45 
19 56 39 31 14 78 70 53 9 (90) 55 11 48 22 32 69 43 80 9 
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Plah 10.7 6 X 6 triple lattice 
Block Rep. I Rep. IT 
(1) Uimaain enone () Hien suU eo mon 
(©) (SOU O mn? (8) 2 8 14 20 26 32 
(3) 13 14 15 16 17 18 @® 3 9 15 21 27 33 
(4) 19 20 21 22 23 24 (10) 4 10 16 22 28 34 
(5) 25 26 27 28 29 30 (1) 6 11 17 23 29 35 
() 31 32 338 34 35 36 (12) 6 12 18 24 30 36 
Rep. IIT 
(13) 1 8 15 22 29 36 
(14) 31.2 9 16 23 30 
(15) 25 32 38 10 17 24 
(16) 19 26 33 4 il 18 
(17) 13 20 27 34 5 12 
(18) 7 14 21 28 35 6 
Plan 10.8 10 X 10 triple lattice 
Block Rep. I Rep. II 
Q@ 1234567 8 9 10 (11) 111 21 3141 516171 81 1 
(2) 11 1213 1415 16 17 18 19 20 (12) 2 12 22 32 42 52 62 72 82 92 
(3) 21 22 23 24 25 26 272829 30 (13) 8 18 23 33 43 53 63 73 83_ 93 
(4) 81 82 83 34 35 36 373839 40 (14) 4 14 24 34 44 54 64 74 84 94 
(5) 41 42 43 44 45 46 47 48.49 50 = (15) _‘5 15 25 35 45 55 65 75 85 95 
(6) Bl 52 53 54 55 56 57 58 59 60 (16) 6 16 26 36 46 56 66 76 86 96 
(7) 61 62 63 64 65 66 67 6869 70 (17) _7 17 27 87 47 57 67 77 87_97 
(8) 71 72 73 7475 76 777879 80 (18) 8 18 28 38 48 58 68 78 88 98 
(9) 81 82 83 84 85 86 87 8889 90 (19) 9 19 29 39 49 59 69 79 89 99 
(10) 91 92 93 94 95 96 97 98 99 100 (20) 10 20 30 40 50 60 70 80 90 100 
Rep. III 
(21) 1 12 23 34 45 56 67 78 89 100 
(22) 91 2 13 24 35 46 57 68 79 90 
(23) 81 92 3 14 25 36 47 58 69 80 
(24) 71 8293 4 15 26 37 48 59 70 
(25) 61 72 83 94 5 16 27 38 49 60 
(26) B1 62 73 8495 6 17 2839 50 
: (27) 41 52 63 74 85 96 7 1829 40 
: (28) 31 42 53 64 75 8697 819 30 
(29) 21 32 43 54 65 76 87 98 9 20 


(30) 11 


22 33 


44 55 66 77 88 99 10 
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Plan 10.9 


Block 


(1) 
(2) 
(3) 
(4) 
(5) 
(6) 
7) 
(8) 
(9) 
(10) 
(1) 
(12) 


(18) 
(14) 
(15) 
(16) 
(17) 
(18) 
(19) 
(20) 
(21) 
(22) 
(23) 
(24) 


(25) 
(26) 
(27) 
(28) 
(29) 
(30) 
(31) 
(82) 
(83) 
(34) 
(35) 
(36) 


LATTICE AND CUBIC LATTICE DESIGNS 


12 X 12 quadruple lattice 


Rep. I 
DoS Abo ert) OS tO, UL 2 
SLA lon wLO eal 7) eS 9 E20 I? Nie3" 24 
25 26 27 28 29 30 31 32 33 34 35 36 
87 38 39 40 41 42 43 44 45 46 47 48 
49 50 51 52 53 54 55 56 57 58 59 60 
61762" (68 64-65 66 - 67 (68 69 70 71 72 
TSS TAS TS DC 28) O79 R80 Si8l- 982) 383) — 84 
8 86 87 88 89 90 91 92 93 94 95 96 
97 98 99 100 101 102 103 104 105 106 107 108 
109 110 111 112 118 114 115 116 117 118 119 120 
121 122 123 124 125 126 127 128 129 180 131 132 
133 134 135 136 137 138 139 140 141 142 143 144 
Rep. IIT 
Lt 13 925 37 49° 61 «7385. 97 «109 121 138 
2 14 26 38 50 62 74 86 98 110 122 134 
3815 27 #39 Sl 63 75 87 99 111 128 135 
4 16 28 40 52 64 76 88 100 112 124 136 
6 1729 41 5865 77 89 «101 «118 125° 137 
6 18 30 42 54 66 78 90 102 114 126 138 
7 19 #31 #48 #55 67 79 91 103 115 127 139 
8 20 32 44 56 68 80 92 104 116 128 140 
9 21 38 45 #57 69 81 93 105 117 129 141 
10 22 34 46 58 70 82 94 106 118 130 142 
ii 2335 47 69 «7183 95 «107 «119 181 148 
12 24 36 48 60 72 84 96 108 120 132 144 
Rep. III 

i a4 278 4000 157 70889 96 101. 114 127° 140 
2 13 28 39 58 69 84 95 102 113 128 139 
8 16 2 38 59 72 81 94 108 116 125 138 
4 15 2 387 #60 71 82 98 104 115 126 137 
5 18 31 44 #49 62 75 88 105 118 131 144 
6 17 32 43 50 G61 76 87 106 117 132 143 
7 2 2 42 51 64 73 86 107 120 129 142 
8 19 30 41 52 63 74 85 108 119 130 141 
9 22 35 48. 53 66 79 92 97 110 123 136 
10 21 36 «647 «#254 «665 «680 «91 «(98 «109 124 135 
11 2% 33 46 55 68 77 90 99 112 121 134 
12 23 34 #45 56 67 78 89 100 111 122 133 
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Plan 10.9 (Continued) 12 X 12 quadruple lattice 
Rep. IV 
(37) 1 24 30 438 53 64 82 95 105 116 122 135 
(38) 2 23 29 #44 #54 63 81 96 106 115 121 136 
(39) 3 22 32 41 55 62 84 93 107 114 124 133 
(40) 4 21 381 42 56 61 83 94 108 113 123 134 
(41) 5 16 34 47 #57 68 (74 87 97 120 126 139 
(42) 6 15 3838 48 58 67 73 88 98 119 125 140 
(48) 7 14 #36 #45 #59 #66 76 85 99 118 128 137 
(44) 8 138 35 46 60 65 75 86 100 117 127 138 
(45) 9 20 2 39 49 72 78 91 101 112 130 143 
(46) 10 19 2 40 50 Vl 77 92 102 111 129 144 
(47) 11 18 28 387 51 70 80 89 103 110 1382 141 
(48) 12 17 27 38 52 69 79 90 104 109 181 142 
Plan 10.10 3 X 4 rectangular lattice 
Block Rep. X Rep. Y Rep. Z 
Dolenime WLe Vil eeaa te vase) At ie Se 
Ae) 4S 15: 6: ¥2; tees a1 Z2 2 9 10 
XS Sheen in eee 23 3 4 il 
X4 10 11 12 Ya 43: 76 9 Z4e Se 
Plan 10.11 4 X 5 rectangular lattice 
Block Rep. X Rep. Y Rep. Z 
1 eS See Yi oe Oe! Az Zi 8 ats 16s) 
x2 6 7 8 os Shae ae Z2 2 9 16 2 
AS) FORO Ee 12 Yor ues 16) lon Lo! 28, 4 17 AaTe: 
X4 18 14 15 16 VA FAT Cin IT. 20 24 1 68 12> 19 
X5 17 18 19 20 Ait wy: ayaa! taal) Zo. (36) ORNS 
Plan 10.12 5 X 6 rectangular lattice 
Block Rep. X Rep. Y Rep. Z 
PGs Wie Ts a Sa} Yl. 6 11 16 21 26 Z1 7 18 19 25 27 
x2 1678-9) 10) Y2 1 12 17 22 27 Z2 5 14 16 23 29 
XS 11 12 13 14 15 Y3 2 7 18 23 28 Z3 1 8 20 21 30 
X4 16 17 18 19 20 Y4 3 8 13 24 29 Z4 2 9 15 22 26 
X5 21 22 23 24 25 Y5 4 9 14 19 30 2 3 10 11 17 28 
X6 26 27 28 29 30 Yé 5 10 15 20 25 Z6 4 6 12 18 24 


x 
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Plan 10.13 6 X 7 rectangular lattice 
Block Rep. X Rep. Y 
Nigel eh si, 4085, 16 Yl 7 18 19 25 31 37 
XO TON Meet, Oe 1OS TL) 12 Y2 1 14 20 2% 32 38 
X83) 18 14015 216) 17 18 ¥3 2 8 21 27 33 39 
X4 19 20 21 22 23 24 Y4 3 9 15 28 34 40 
X5 25 26 27 28 29 30 Y5 4 10 16 22 35 41 
X6 31 32 33 34 35 36 Y6 5 11 17 23 29 42 
X7 37 38 39 40,41 42 Y7 6 12 18 24 30 36 
Rep. Z 


Z1 12 17 22 28 33 38 


Z2 2 18 24 29 385 40 
Z3 4 20 25 36 42 
Z4 6 it 16 27 32 37 
Zot 18 23 34 39 
Z 3 8 14 19 30 41 
27 66 10 15 21 26 31 
Plan 10.14 7 X 8 rectangular lattice 
Block Rep. X Rep. Y 


immo woes. (se 1b). 6) lef YL 
x2) 8 19 10) AT 1218 14 y2 
X3 15 16 17 18 19 20 21 ¥3 
X4 22 23 24 25 26 27 28 y4 
X5 29 80 31 382 33 34 35 Yb 
X6 36 387 38 39 40 41 42 Y6 
X7 48 44 45 46 47 48 49 XC 
X8 50 51 52 53 54 55 56 Y8 


15 22 29 36 43 50 
‘116 23 30 387 4 51 
2 9 2 81 38 45 52 

10 17 32 30 46 53 
54 
12 19 26 33 48 55 
13 20 27 34 41 56 
1421 28 35 42 49 


Nt] oe} ot} B] oo] v0] +] co 
= 
— 
o-) 
nm 
nr 
— 
Sy 


Rep. Z 
Zi 9 17 25 33 41 49 51 
22 7 18 26 34 88 43 53 
23 1 10 27 85 36 47 55 
Zt 2 11 19 29 42 44 56 
Z 3 12 20 28 87 45 50 
Z6 4 13 21 22 30 46 52 
Zi 5 14 15 23 81 39 54 
Z8 6 8 16 24 32 40 48 
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Plan 10.15 8 X 9 rectangular lattice 
Block Rep. X Rep. Y 
Xd CA Ro VSR eee aces Yl 9 17 25 33 41 49 57 65 
XQ £9 10 e112) As 1 Ss ¥2 1 18 26 34 42 50 58 66 
X83 17 18 19 20 21 22 23 24 ¥3) QPL (2735) ABi Hl 259 N67 
X4 25 26 27 28 29 30 31 32 y4 3 11 19 36 44 52 60 68 
X5 33 34 35 36 37 38 39 40 Y5 4 12 20 28 45 53 61 69 
X6 41 42 43 44 45 46 47 48 Ye 5 13 21 29 37 54 62 70 
X7 49 50 51 52 53 54 55 56 ¥7 6 14 22 30 38 46 63 71 
X8 57 58 59 60 61 62 63 64 Ys 7 15 28 31 39 47 55 72 
X9 65 66 67 68 69 70 71 72 Yo 8 16 24 32 40 48 56 64 

Rep. Z 

Zi 16 23 30 37 45 52 59 66 
Z2, 2 17 32 39 46 54 61 68 
ZB 4 11 26 33 48 55 63 70 
Z4 6 18 20 35 42 49 64 72 
Z 8 15 22 29 44 51 58 65 
Zo 1 9 24 31 38 53 60 67 
Z7 8 10 18 25 40 47 62 69 
ZB 5 12 19 27 34 41 56 71 
Zo 7 14 21 28 36 43 50 57 
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Plan 10.16 


Block 
X1 
X2 
X38 
X4 
X5 
X6 
X7 
X8 
x9 
X10 


Zz 
Z2 
23. 
ZA 
25 
Z6 
“7 
28 
“9 
Z10 
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9 X 10 rectangular lattice 
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CHAPTER 11 


BALANCED INCOMPLETE BLOCKS 


11.1 Description 


The balanced lattices described in the last chapter are a particular 
group of a general class of designs known as balanced incomplete blocks. 
All these designs have the property that any pair of treatments appears 
together equally often within some block. Thus in plan 11.1 every pair 
of treatments appears together once in the same block, in plan 11.2 twice, 
and in plan 11.5 four times. This property insures that the same stand- 
ard error may be used for comparing every pair of treatments; it also 
facilitates the statistical analysis, since any treatment total is adjusted 
in a single operation for all the blocks in which the treatment appears. 

The construction of the designs presents interesting mathematical 
problems, (11.1), (11.2), (11.3). Although a design can be found for any 
number of treatments, , and any size of block, k, most of these are of no 
interest for our purpose, since they require too many replications. The 
plans in this chapter have been restricted to those in which the number 
of replicates does not exceed 10. 

The balanced lattices are the particular set of designs for which the 
relation t = k2 holds. They also have the property that the blocks can 
be grouped in separate replications. The majority of the balanced in- 
complete block arrangements do not possess this property, which is pos- 
sible only when ¢ is a multiple of k. A few designs can be sorted into 
groups which comprize two or three replicates each. 

Tn most plans, the block contains six or fewer units. Accordingly, the 
designs are adapted for experiments in which the appropriate size of 
block is small. They have been applied in greenhouse pot experiments, 
where the block is restricted to the width of the bench; in experiments 
on plant virus diseases, where the block consists of a small number of 
leaves on each plant; in experimental cookery where there is a limited 
number of stoves, in tests of mosquito repellents, reference (11.4), where 
the block consists of the two exposed arms of a subject; and in 
nutritional experiments where each child constitutes a block, 3 different 
foods being given during the 3 terms of the school year. 

315 
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Examples from field experimental work occur in blueberry fertilizer 
trials where, owing to the uneven growth of different bushes, only a small 
number of them can be grouped into a homogeneous block. A similar 
consideration may limit the size of block in experiments on the control 
of fruit pests, where the tree constitutes the experimental unit. 

A few types of factorial design—the 5 X 2,5 X 3,7 X 3, and 7 X 4— 
can be arranged in balanced incomplete blocks with the result that main. 
effects and interactions are confounded to the same extent. It happens 
that in these cases there is no suitable method of confounding the inter- 
actions alone. 


11.2 Comparisons with Other Designs 


Much experimental material is of the type exemplified above, in that 
the natural grouping for use as a block accommodates only a small num- 
ber of units. Where the number of treatments exceeds the number of 
units in a block, there are three alternatives to the use of a balanced 
incomplete block design. (1) A common treatment is placed in each 
block, so as to provide a basis for comparison between treatments which 
are in different blocks. (2) The treatments are divided into groups such 
that the number of treatments per group is equal to the number of units 
per block. The groups are compared in a split-plot arrangement (chap- 
ter 7). (3) Complete replicates are assembled by the grouping of units 
from more than one block, in order that a randomized block design may 
be used. The merits of the first two alternatives were discussed in sec- 
tion 9.5. 

The relative efficiency of randomized blocks and balanced incomplete 
blocks has been discussed by Yates (11.5). Balanced incomplete block 
designs which are arranged in replications cannot be appreciably less 
accurate than randomized blocks. With balanced incomplete blocks 
which cannot be arranged in replications, a comparison with randomized 
blocks is not easy except by the use of uniformity data. Certain con- 
clusions can be drawn from the nature of the experimental material. If 
there is some association among the members of different, blocks, it may 
be possible to form replicates which are only slightly less homogeneous 
than the blocks. For example, suppose that in an animal experiment the 

‘litter is the natural block. If the responses of the animals are influenced 
by their genetic constitution and if replications are constructed from 
closely related litters, the experimental error may be only slightly greater 
within a replication than within a block. In this event randomized 
" blocks may be the more accurate layout. 
Sometimes the investigator has no information about any associations 
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between the groups which constitute the blocks; he knows merely that: 
the experimental responses are likely to differ from group to group. In 
these circumstances Yates (11.5) has shown that incomplete blocks (with 
the recovery of inter-block information) are more accurate on the aver- 
age except when the blocks are ineffective. There is the additional ad- 
vantage that the block differences can be measured as a guide to the de- 
sign of future experiments, whereas with replications composed of units 
from several blocks it would be difficult to estimate the block effects. 

It is worth noting the efficiency factor (2) of any balanced incomplete 
block design whose use is under consideration. This factor, which is shown 
as a decimal fraction in table 11.3 (p. 327) and in the plans, is a lower 
limit to the efficiency of balanced incomplete blocks relative to ran- 
domized blocks. With 37 treatments in blocks of 9 units, for instance, 
the efficiency factor is 91%, so that the loss of efficiency could not exceed 
9%. The loss equals 9% only in the rather unlikely situation in which 
the replications of 37 units are as uniform as the blocks of 9 units, yet the 
variation among blocks is large compared to that within blocks. With 
any reasonable prospect that the blocks are more homogeneous than 
the replicates, the incomplete block design is to be preferred in this 
case. On the other hand, with 7 treatments in blocks of 2 units, the 
efficiency factor is only 58%. In this case incomplete blocks will be 
more accurate only if there is a substantial reduction in error variation 
because of the reduction in block size. 


11.3 Arrangement of Experimental Material 


Units within the same incomplete block should be as homogeneous as 
possible. For those designs which can also be grouped in replications, it 
is advisable to place similar blocks, so far as these can be discerned, 
within the same replicate. 

Certain of these designs can be rearranged to form an incomplete latin 
square (chapter 13) in which variation associated with two types of 
grouping can be eliminated from the experimental errors. The experi- 
menter should consider whether advantage can be taken of this double 
control so as to reduce the error. 


11.4 Randomization 
The steps are: 
1. Rearrange the blocks at random. (If the design is arranged in 
complete replications, the blocks are randomized only within each repli- 
cation and the replications are kept separate.) 
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2. Randomize the positions of the treatment numbers within each 
block. 
3. Assign treatments at random to the numbers in the plan. 


11.5 Statistical Analysis 


In order to present the shortest computational methods, it is ad- 
visable to divide the plans into five types, which are shown in the index 
at the end of this chapter. 


11.51 Type I. Designs Arranged in Replications. We will analyze 
an example with 6 treatments in blocks of 2 (plan 11.3). The experiment 
was conducted by Dr. Pauline Paul at Iowa State College (11.6). Its 
object was to compare the effects of length of cold storage on the tender- 
ness and flavor of beef roasts. Six periods of storage (0, 1, 2, 4, 9, and 
18 days) were tested: these are denoted by treatment symbols 1, 2, 3, 
+++, 6, respectively. 

Thirty roasts from the round of an animal were used. Four muscles 
each provided 6 roasts, while 3 muscles each provided 2 roasts. The 
roasts on any muscle group themselves naturally into pairs, since to each 
roast on the left side of an animal there corresponds another roast on 
the right side. From previous experience it was believed that the 2 
roasts in any pair would give closely similar results. Variation among 
different pairs from the same muscle was expected to be somewhat larger, 
and variation among muscles to be still larger. 

These opinions prompted the use of a design in blocks of 2, each block 
comprizing the left and right roasts in a pair. When grouping the blocks 
into replications, it was natural to put roasts from the same muscle into 
the same replicate. With the first 4 muscles a separate replicate could 
be made from each muscle. The remaining replication consisted of the 
3 smaller muscles. 

The plan and the scores for tenderness are given in table 11.1. Scoring 
was done by 4 judges, each marking on a scale from 0 to 10. The scores 
shown are their totals (out of 40), a high score indicating very tender 
beef. 

The following symbols are used: ¢ = 6 = the number of treatments; 
k = 2 = number of units per block; r = 5 = number of replications; 
b = 15 = number of blocks. 

1. Find the block totals, B, the replicate totals, the grand total, G, 
and the treatment totals, 7’, placed below the plan. 
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TABLE 11.1 Scorws FOR TENDERNESS OF BEEF 


Rep. I Rep. IL Rep. IIL 
B B B 
(1) 7 (2) 17 24 (1) 17 (8) 27 «44 (1) 10 (4) 25 35 
(83) 26 (4) 25 51 (2) 28 (5) 27-50 (2) 26 (6) 37 68 
(5) 83 (6) 29 62 (4) 29 (6) 30 59 (3) 24 (5) 26 50 
137 153 148 
Rep. IV Rep. V 
B B 


(1) 25 (5) 40 (65 (1) 11 (6) 27) 38 
(2) 25 (4) 34 59 (2) 24 (8) 21 45 
(8) 34 (6) 32-66 (4) 26 (5) 32 58 


190 141 

Storage Treat- 
time ment Adj. 
(days) no. Te B, Q Ww T+y»W means 
0 1 70 206 —66 19 71.8 14.4 
1 2 5 00=Ci«Ci (1D D4. 11758) 0 52845) 
2 3 132 256 8 17 133.6 26.7 
4 4 139 262 16 15 140.4 28.1 
9 5 158 285 31 —24 155.7 31.1 
18 6 155 288 22 —51 150.2 30.0 

G = 769 1538 0 0 769.0 


2. For each treatment, calculate the following quantities: 
B, = total of all blocks in which the treatment appears 
Q=kT —B,=2T —-B: 
W=(¢—HT —(—)Be+ k—- DE = 47 5B. + G 


The total of the Q’s and the W’s should each be zero. 
3. The analysis of variance is as follows: 


df. 8.8. m.s8. 
Replications ir -) 4 298.5 
Treatments (unadj.) (t— 1) 5 1059.8 
Blocks within reps. (adj.) —7) 10 213.4 21.34Ey 
Intra-block error ( —t—b+1) 10 77.3 7.73E 


Total ( — 1) 29 1649.0 
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The only sum of squares requiring special instructions is that for blocks 
within replications, adjusted for treatment effects. Since we do not 
know any simple way of finding this directly, it is obtained by a round- 
about method. Calculate in the usual way the unadjusted sum of squares 
for blocks within replications: this comes to 753.0. Then compute the 
sum of squares for treatments, adjusted for blocks, which is 


GHD 2AOt no ee 
ktr( — 1) (2)(6)(5)(1) 


The adjusted sum of squares for blocks is then given by the following 
combination of sums of squares 


+ (66)? + (11)? +++ ++ (22)?] = 520.2 


Blocks (unadj.) ++ treatments (adj.) — treatments (unadj.) 

753.0 aa 520.2 = 1059.8 = 213.4 

4. Calculate the weighting factor 

(Ey — E,) 
w= 
rik — 1)E, + kb —r —t+4+ 1)E, 
5(21.34 — 7.73) 
(30) (21.34) + (10) (7.73) 
The adjusted treatment totals are 
T+puWw 
and the means per unit are found on division by r. 
5. The eflective error variance per unit is estimated as 
E{1 + (¢ — k)u] = 7.73[1 + (4)(0.0948)] = 10.66 
We derive t-tests from this figure by the ordinary rules. For instance, 
the standard error of the difference between two adjusted totals is 
V 2r(10.66), or 10.32; for the difference between two adjusted means the 
standard error is ‘V (2)(10.66)/r, or 2.06. 

6. If an F-test is desired, calculate the sum of squares of the adjusted 
treatment totals. This figure, 4718.0, is divided by r(t — 1), or in this 
case 25, in order to find the mean square on a unit basis. The mean 
Square, 188.7, is compared with the effective error m.s., 10.66, which is 
assigned (tr — 1 — 6 + 1), or 10, df. ‘The F-ratio, 188.7/10.66, is highly 


significant. The test is not exact, because it ignores the effects of errors 
in the weighting factor y. 


Jn an experiment of this type, interest centers on the shape of the 
response curve rather than on comparisons between pairs of treatments, 


= 0.09484 
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The adjusted means indicate the common “diminishing returns” curve. 
An exponential of the form y = A — Be~“, where t is the time of stor- 
age, fits the data satisfactorily. A parabolic regression on ¢, which is 
easier to compute, will be found to be inadequate. The conclusion is 
that storage up to about a week increases tenderness. 

7. The gain in precision over randomized blocks is estimated as fol- 
lows. By pooling the sums of squares for blocks within replications and 
intra-block error, we obtain an estimate of the error m.s. that would have 
applied with randomized blocks. This figure, 14.54, is compared with 
the effective error m.s., 10.66. The relative precision is estimated as 
14.54/10.66, or 136%. Since the randomized block design has 20 
error d.f. as against 10 for the design actually used, this estimate should 
be reduced to 126% by the adjustment described in section 2.31. 


11.52 Type Il. Designs Arranged in Groups of Replications. Only 
two of the plans are of this type, plans 11.6 and 11.13, and in each case 
the group consists of two replications. (Plan 11.2 is also in this class, but 
is better analyzed as shown in section 11.55.) The analysis is very sim- 
ilar to that in the previous section. Two changes should be noted. If 
c is the number of groups, so that ¢ = r/2 for the plans presented here, 
the degrees of freedom in the analysis of variance subdivide as shown 
below. 


df. m.8. 
Groups (ce — 1) 
Treatments (unadj.) (i — 1) 
Blocks within groups (adj.) (b —c) Ey 
Intra-block error (i —t—b+1) Ee 
Total ( —1) 


The second change is that the weighting factor « becomes 
Wd (6 — (My — Be) 
Bk —DO— OK + U-HO-—c-t+ IE 


11.53 Type III. Designs Not Arranged in Replications or Groups of 
Replications. Here again the method of section 11.51 may be followed 
except that no replication effects appear. The analysis of variance now 
reads as follows. 


af. m.8. 
Treatments (unadj.) (t— 1) 
Blocks (adj.) (b — 1) Ey 
Intra-block error ( —t-—b +1) Ee 


Total ( -1) 
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As before, the sum of squares for blocks (adjusted) is found by a sub- 
traction process. The weighting factor becomes 


 - 1), - BE) 
i(k — 1) — 1)E, + @ — k)b — HE. 


BX 


Otherwise the method of section 11.51 applies. Of course, the gain in 
accuracy relative to randomized blocks cannot be estimated. A worked 
example with t = 9, k = 4, has been given by Yates (11.5, 11.7). His 
method of calculating the adjusted sum of squares for blocks is slightly 
different from ours, since he divides this sum of squares into two com- 
ponents so that the calculation becomes partially self-checking. Results 
should be identical by either method. 


11.54 Type IV. Experiments with t= 6. These designs, which can- 
not be arranged in replications, are covered by the computing instruc- 
tions given in the previous section. However, the analysis simplifies 
slightly when ¢ = b, because the sum of squares for blocks (adjusted) 
can be found directly. The data in table 11.2 are from an experiment on 
corn hybrids with t = b = 18, r= k = 4. For field experiments a de- 
sign that can be laid out in separate replications is usually preferred. 
In this experiment, conducted in 1943 by the North Carolina Agricultural 
Experiment Station, it was desired to compare 10 hybrids, all potentially 
suitable for commercial use, at a number of places throughout the state, 
and at each place to test in addition 3 standard varieties that. were 
adapted to that place. With 13 treatments no design that can be put into 
replications is available, and since 4 was a convenient number of replica- 
tions, the design with ¢ = 18, r =k = 4 was chosen. Randomized 
blocks might have been equally good. Incidentally, the plan used 
is different from that given in plan 11.22, a different method of con- 
struction having been employed. Both plans have the same structural 
properties. 

1. Find the block totals, B, the treatment totals, 7’, and for each 
treatment find the total B, of all blocks in which the treatment appears. 
Time is saved if 7’ and B; are found simultaneously. The B; values should 
add to k times the grand total G. 

2. For each treatment find 


W = (—)T —¢—1)B.+ & — 0G 


= 9T — 12B, + 3G (in this example) 
The W’s should add to zero. 


11.54 TYPE IV. EXPERIMENTS WITH t =} 323 


TABLE 11.2 Puan AND YIELDS OF CORN (POUNDS PER PLOT) 


Block 
Block totals 
1 (8) 25.3 (6) 19.9 (9) 29.0 (11) 24.6 98.8 
2 (3) 23.0 (4) 19.8 (8) 33.3 (12) 22.7 98.8 
3 (10) 16.2 (1) 19.3 (12) 31.7 (13) 26.6 93.8 
4 (2) 27.3 (5) 27.0 (8) 35.6 (11) 17.4 107.3 
5 (7) 23.4 (8) 30.5 (9) 30.8 (10) 32.4 neha 
6 (4) 30.6 (5) 32.4 (6) 27.2 (10) 32.8 123.0 
id (1) 34.7 (5) 31.1 (9) 25.7 (12) 30.5 122.0 
8 (8) 34.4 (5) 32.4 (7) 33.3 (13) 36.9 137.0 
3) (1) 38.2 (2) 32.9 (3) 37.3 (10) 31.3 139.7 
10 (2) 28.7 (4) 30.7 (9) 26.9 (18) 35.3 121.6 
11 () 36.6 (4) 31.1 (7) 31.1 (11) 28.4 127.2 
12 QQ) 31.8 (6) 33.7 (8) 27.8 (13) 41.1 134.4 
13 (2) 30.3 (6) 31.5 (7) 39.3 (12) 26.7 127.8 
, 1548.5 
No. T Bi Ww T +uW 
1 141.3 523.3 —362.4 136.7 
2 119.2 496.4 —238.5 116.2 
3 120.0 474.3 33.9 120.4 
4 112.2 470.6 8.1 112.3 
5 122.9 489.3 —120.0 121.4 
6 112.3 484.0 —151.8 110.4 
7 127.1 509.1 —319.8 123.0 
8 127.2 457.6 299.1 131.0 
9 112.4 459.5 143.1 114.2 
10 112.7 473.6 — 23.4 112.4 
11 89.7 427.1 327.6 93.9 
12 111.6 442.4 341.1 115.9 
13 139.9 486.8 63.0 140.7 
1548.5 6194.0 0 1548.5 
3. The analysis of variance is: 
d.f. 8.8. m.s. 
Treatments (unadj.) (¢-—1) 12 542.67 
Blocks (adj.) (@ —1) 12 475.27 39.61%, 


Intra-block error ( —2t+1) 27 538.21 19.93H. 


Total ( — 1) 51 1556.15 


The sum of squares for blocks (adjusted) is found directly as the sum 
of squares of the W’s, divided by ir(t — k)(k — 1), or in this case 
(4) (13) (9) (8) = 1404. 
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4, The weighting factor » simplifies to 
Ey, — E, 19.68 
"i(k — DE, (13)(3)(89.61) 


The adjusted treatment totals are the quantities (7 + »W). 
5. As before, the effective error m.s. per unit is taken as 


Efi + (£ — bp] = 19.93[1 + (9)(0.0127)] = 22.2 


Other features of the analysis are the same as in previous sections. Yates 
(11.5) presents an example with ¢ = 21, k = 5. 
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11.55 Type V. Small Experiments. In certain of the smaller de- 
signs, the numbers of degrees of freedom in the mean squares for blocks 
and intra-block error are rather small. Consequently, the estimates of 
the relative weights assigned to inter- and intra-block comparisons are 
poor. In these cases, unless the whole plan has been repeated to secure 
extra replication, it is best to use the original method of analysis that 
Yates developed (11.8). This method adjusts the treatment means for 
differences between the blocks, but makes no use of inter-block informa- 
tion. The analysis is a little simpler than when inter-block information 
is used, 

For purposes of illustration, this analysis will be applied to the ex- 
ample in the previous section, with ¢ = b = 13, r = k = 4, though in 
practice inter-block information should be utilized in an experiment of 
this size. The data are presented in table 11.2. 

1. Find the block totals, B, and the treatment totals, 7. For each 
treatment find the total B; over all blocks in which the treatment appears. 

2. For each treatment compute 


Q=kT —B,=4T —B, 
—1 12 

v=m+ Q=m+ 
tr(k — 1) (13) (4)(3) , 
where m = 1548.5/52 = 29.78 is the mean of the whole experiment. 
The ¢’ values are the adjusted treatment means. The Q values should 
add to zero. 

3. The analysis of variance is shown below. 


=m + 0.0769Q 


df. 8.8. m.s. 
Blocks (unadj.) (6 —1) 12 689.38 
Treatments (adj.) (é — 1) 12 328.55 27.38 


Intra-block error (r —t-—b+1) 27 538.22 19.93E, 


Total ( — 1) 51 1556.15 


Note that the blocks sum of squares is unadjusted. It is found in the 
usual way as the sum of squares of deviations of the B’s, divided by k. 
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This rule applies even when the experiment is arranged in replications 
or groups of replications, though if the experimenter wishes, he may di- 
vide the blocks sum of squares into that due to replications (or groups) 
and that due to blocks within replications. 

The adjusted sum of squares for treatments is 


tL ree 12 2 Se ‘ 
Enh  PeRseO a + 728)" 
= 328.55 


As usual, the intra-block error s.s. is found by subtraction. 

4. For t-tests, the effective error variance is 

ie [ _ tok kt —1) — (19.93)(12)(4) _ 
‘Lae = Dl "ee —- 1) 3)8) 

For the variance of the difference between two adjusted treatment, 
means, we multiply this quantity by 2/r. 

5. The analysis above provides an exact /’-test of the adjusted treat- 
ment means, The F-ratio is 27.38/19.93, with 12 and 27 d.f. 


11.56 Missing Data. For reasons given in section 10.13, it is recom- 
mended that the analysis of incomplete data be carried out by inserting 
for the missing observations values which minimize the intra-block 
error ss. In our notation the formula, first given by Cornish (11.9), is 
ir(k — IB + k(t — 1)Q — (t — 1)Q! 
(k — Dlir(e — 1) — kt — D)] 
where B is the total of the block containing the missing observation, Q is 
the Q value (= kT — B,) for the treatment that contains the missing 
value, and Q’ is the sum of the Q values for all treatments that are in the 
block with the missing value. 

Example 1. Tn the experiment on roast beef in section 11.51, suppose 
that treatment (5) in replication IV is absent. The first: step is to find 
the block totals, the treatment totals, the Br and the Q values as in the 
ordinary analysis. An x should be inserted for the missing observation. 
The data read as follows. ; 


24.5 


oo 


Treatment 
number i B: Q=kT — B: 
1 70 166 +2 -26 —2 
2 115 241 -u 
3 132 256 8 
4 139 262 16 
5 118 +2 245 +2 =—9+2 
6 155 288 22 


Totals 729 +2 1458 + 2a 0 
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Since the block with the missing value contains treatments (1) and (5) 
Q = —26 —9 = —35 
Henee, since B is 25, 
30B +109 —5Q’ 6B+2Q-—Q' 
20 4 
(6)(25) + (2)(—26) — (~35) _ 
4 


On substituting this value in all places where an « occurs in the table 
above, we are ready to proceed with the analysis of variance. 

Example 2. For an example with 2 missing observations, suppose 
that treatments (8) and (13) are missing from block 12 in the corn ex- 
periment in section 11.54. Denote the values to be inserted by x and y, 
respectively. To save space, the preliminary data are presented only 
for the treatments that occur in the same block as the missing observa- 
tions; these are treatments (1), (6), (8), and (13). 


x 


42 


Treatment 
number ah B, Q=47T — B; 
1 141.3 4544 +2+y 110.8 —z —y 
6 112.3 451 +2+y 34.1 —a2—y 
8 99.4 + 2 388.7 +a+y 8.9 +32 —y 


13 98.8 + y 4179+2+y —22.7 — 2 + 3y 


131.1 = Q' 


B=65.5+at+y 


In the 3 replications in which it appears, the average for treatment (13) 


is 33: this will be chosen as the first approximation to y. Then for x we 
have 


B = 65.5 + 33.0 = 98.5; Q = 8.9 — 33 = —24.1; Q’ = 181.1 
Since? = 130k = 7 = 4, 
Mee 156B + 48Q — 12Q’ . 13B + 4Q — Q’ 
324 27 
(18) (98.5) + (4)(—24.1) — (131.1) 
27 


We now substitute « = 39.0 in the preliminary data and obtain a 
second approximation to y. For this we have 


B = 65.5 + 39.0 = 104.5; = —22.7 — 39.0 = —61.7 
Q = 131.1 


39.0 


——————— _— 
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Note that Q’, which does not involve z or y, remains fixed throughout the 
calculation. The second approximation to y is 


(13)(104.5) + (4)(—61.7) — (31.1) | 
27 


The next 2 approximations will be found to be x = 40.1, y = 36.7, 
which may be used as the values to be inserted. 

The rest of the analysis proceeds as usual; the intra-block error de- 
grees of freedom are reduced by 1 for each missing observation. In the 
calculation of standard errors for ¢tests of the adjusted treatment 
means, the method given in section 10.13 should be followed. 


36.3 
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TABLE 11.3 InpEx TO PLANS 


t k ze b At E Plan Type 
4 2 3 6 1 67 ah Vv 
3 3 4 2 .89 Vi 
5 2 4 10 1 «62 11.2 Vv 
3 6 10 3 83 7 It 
4 4 5 3 94 * Vj 
6 2 5 15 1 60 11.3 I 
3 5 10 2 80 11.4 Ill 
3 10 20 4 80 11.5 u, 
4 10 15 6 -90 11.6 IL 
5 5 6 4 -96 34 v 
7 2 6 21 1 58 * Ill 
3 3 7 1 -78 IN Ers Vi 
4 4 v6 2 88 11.8 v 
6 6 if 5 Aye 3 v 
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TABLE 11.3 Inpex 70 pLans (Continued) 


t k ts b AT E Plan Type 
8 2 7 28 1 57 11.9 I 
4 % 14 3 -86 11.10 I 
7 7 8 6 98 = Vv 
9 2 8 36 1 -56 ns Tit 
4 8 18 3 84 11.11 Il 
5 10 18 5 -90 11.12 It 
6 8 12 5 94 11.13 Ir 
8 8 9 7 -98 * IV 
10 2 9 45 1 56 11.14 I 
3 9 30 2 +74 11.15 Til 
4 6 15 2 -83 11.16 Il 
5 2) 18 4 .89 11.17 Il 
6 9 15 5 -93 11.18 Tit 
G) 9 10 8 +99 ss IV 
11 2 10 55 1 55 Til 
5 5 ll 2 -88 1-19 IV 
6 6 ll 3 92 11.20 IV 
10 10 1 9 99 “ IV 
13 3 6 26 1 +12 11.21 Til 
4 4 13 1 81 11.22 IV 
9 9 13 6 96 11.23 IV 
15 3 7 35 1 at 11.24 I 
7 7 15 3 92 11.25 IV 
8 8 15 4 «94 11.26 IV 
16 6 6 16 2 89 11.27 IV 
6 9 24 3 89 11.28 Il 
10 10 16 6 96 11.29 IV 
19 3 9 57 1 -70 11.30 Til 
9 9 19 4 -94 11.31 IV 
10 10 19 5 95 11.32 IV 
21 3 10 70 1 -70 11.33 I 
5 5 21 1 84 11.34 IV 
7 10 30 3 -90 11.35 Til 
25 4 8 50 1 -78 11.36 Il 
9 9 25 3 93 11.37 IV 
28 4 9 63 1 -78 11.38 I 
7 9 36 2 -89 11.39 iit 
31 6 6 31 1 86 11.40 IV 
10 10 31 3 93 11.41 IV 
37 9 9 37 2 91 11.42 IV 
41. 5 10 82 a 82 11.48 Til 
57 8 8 57 1 -89 11.44 IV 
73 9 9 73 il -90 11.45 IV 
91 10 10 91 1 91 11.46 IV 


* These plans are constructed by forming all possible combinations of the ¢ numbers 
{n groups of size k. The number of blocks b serves as a check that no group has been 
missed. 

+ Number of times that two treatments appear together in the same block. 
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PLANS 
Plan 11.1 t=4,k =2,r =3,b =6,\ = 1, E = 0.67, Type V 
Block Rep. I Rep. II Rep. IT 
a 1 2 (3) 1 3 (5) 1 4 
3) ae (4 2. #4 (a) os 
Plan 11.2 t=5,k =2,r =4,b = 10, = 1, E = 0.62, Type V 
Reps. I Reps. III 
Block and IL and IV 
On hi (6) 1 4 
(Qi sn (G) Ohl 
(3) 2 5 (8) 3 5 
(4) 1 3 (9) 1 5 
(5) 4 5 (10) 2 4 

Plani18  t=6,k =2,r=5,b =15, =1, E =0.60, Type! 

Block Rep. I Rep. II Rep. III Rep. IV Rep. V 
(QQ) 1 2 (4) 1 3 (7) 1 4 (10) 1 5 (18) 1 6 
(2) 3 4 (5) 2 5 (8) 2 6 (11) 2 4 (14) 2 3 
(3) 5 6 (6) 4 6 (9) 3 5 (12) 3 6 (15) 4 5 

Plan 11.4 =6,k =3,r =5,b = 10, = 2, H = 0.80, Type II 

Block 
@ 12 5 (6) 2 3 4 
@ 126 () 23 6 
(8) 1-38 4 (8) 2 4 6 
(4) 1 3 6 (9) 3 5-6 
(5) 1 456 (10) 4 5 6 
Plan 11.5 1=6,k =3,r = 10,6 = 20, =4, E = 0.80, Type I 
Block Rep. I Rep. IL Rep. IIT Rep. IV 
(1) = e268 (3) ha? # (5) 1 2 65 () 1 2 6 
(2) 4 5 6 4) 3 5 6 (6) 3 4 6 (8) 3 4 5 
Rep. V Rep. VI Rep. VII Rep. VIII 
(9) 13 4 (Cit) eS ab (1S)2 ees (15) 1 4 5 
(10) Bop (12) 25456 (4) 20405 (16) 2 3 6 
Rep. IX Rep. X 
(7) L4G: (19) 1 5 6 
(18) 2 3 5 (20) 2 3 4 
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Plan ii  t=6,k =4,r=10,b = 15, =6, E = 0.90, Type II 
Block Reps. I and II Reps. III and IV Reps. V and VI 
a) Poe a 2 eh tar Oe ches @ © 2 3) 6 
(2) 1 4 5 6 (5) 1 2 4 6 (8) 1 3 4 5 
@) he sl aes (Cn Seo a @ 2 4 #5 6 
Reps. VII and VII Reps. IX and X 
(10) 1 2 4 5 (18) 1 2 5 6 
ay 1 3 5 6 (14) 1 3 4 6 
(12) 2 3 4 6 (15) 2 3 4 5 
Plan 11.7 t=7,k =3,r =3,b =7,\ =1, E = 0.78, Type V 
Block 
Gy 294 (3) 3 6 () 5 6 1 (70) ve 3D 
QQ) 2 3 5 (4) 4 ai (6) 6 7 2 
Plan ii8 t=%k=4,r=4,b=7,\ =2, E = 0.88, Type V 
Block = 
(Go) ea Tene (4) 12 3 6 (7) 24 5 6 
Q 1467 (ay) See 
(03) Cea eave (6)' 8) ae: 
Plan 11.9 t=8,k =2,r =7, b = 28, \ =1, E = 0.57, Type I 
Block Rep. I Rep. II Rep. III Rep. IV 
@: i 2 ®) 1 3 (9) 1 4 (13) 1 5 
(2) 3 4 (6) 2 8 (10) 2 7 (14) 2 3 
(3) 5 6 Lae e a mmmnG.) 5), 47 
(4) 7 8 (8) 6 7 (12) 5 8 (16) 6 8 
Rep. V Rep. VI Rep. VII 
(tra). ah 6 (21) 1 7 (25) 1 8 
(is) 20 G4 (2) QuemG (8) 20005 
(19) 3 8 (28) 3 5 (27) 3 7 
(20) 5 7 (24) 4 8 (28) 4 6 
Plan 11.10 t=8,k =4,r =7,b = 14, \ =3, E = 0.86, Type I 
Block Rep. eg Rep. II Rep. III Rep. IV 
(Gyo Te253 4 (SB) 2S: () 1368 (HY 1 4 6-7 
@ 5678 @ 3456 (6) 2457 @) 2358 
Rep. V Rep. VI Rep. VIL 
(9) 12 5 6 (GB) bs iss (13) 14 5 8 
(0) 3 47 8 (12) 24 6 8 (144) 2367 
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PLANS 
Plan 11.11 t=9,k =4,r=8,b = 18, =3, E = 0.84, Type III 
Block 
(@. dmeneue ( P1309 (3) 2.5 6 8 
@ 12 5 6 @) ieee a4 3.5 8 9 
@elaans @) 280800 (15) 46 7 9 
@ 1357 (0) 24 5 9 (16) 3 4 5 6 
(6) 14 6 8 (a) 267 9 ay 396.78 
(6) 13 6 9 (2) 23 4 7 (gs) 45 7 8 
Plan 11.12 t=9,k =5,r =10,b =18, =5,2 = 0.90, Type IIT 
Block 
cy we was @i AES DeG |) Cle) un espaibns 
) 23 46 8 @~ 12369 (4) 12467 
(3)'2 4 5 7 8 Q) Densesns ds), omen 
Ca CO) elo A 
(§) 8 405EGne, Gi) Seaeese, | aie 
CORSE (CO) Dear Cb) Thee! 
Plan 11.18 t=9,k =6,r=8,b =12,\ =5, H = 0.94, Type I 
Block Reps. I and IT Reps. III and IV 
(a) 124578 (D. DP QenOn a 
Q@ 23 5689 Giiesme omen 
3) 134679 @ 23467 8 
Reps. V and VI Reps. VII and VIII 
M713 5678 (0) 4567 8 9 
() 124689 (a) 1238 4 5 6 
9) 2345 79 (2) 1 2 8 789 
Plan 11.14 t=10,k =2,r =9, b = 45, =1, E = 0.56, Type I 
Block Rep. I Rep. II Rep. HI Rep. IV Rep. V 
‘ln @ 1 38 @) 1 4 Ge) i 6 @D 1 6 
Q 3 4 @ Qu (a2) eeio” (07) 2 8 (22) 2 9 
a) 5 6 (4 8 (12) 3.7 (18) 3 10 (3) 38 8 
4 7.8 @© 5 9 (4) 5.8 (19) 4 9 (4) 4 10 
6) 9 10 (10) 6 10 (15) 6 9 (20) 6 7 (25) 5 7 
Rep. VI Rep. VII Rep. VIII Rep. IX 
(2G) enna) eS en (00) use pe 1s LEED 
lonenmers (GOA cms) aC) 2ant iG) 2iase 
es) 3.9 (3) 4 6 (38) 3 5 (43) 3 6 
(99) 4.5 (34) 5 10 (39) 6 8 (44) 4 7 
(go) 8 10 (35) 7 9 (0) 7 10 (45) 8 9 


| 
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Plan 11.15 1 =10, k =3,r =9,b = 30, d = 2, B = 0.74, Type Ul 
Block 
(Gh) Seas} M17 9. ds) 259 (9) 385106) | (25), 4 16; 9 
eins (8) 1 810 (14) 2 6 7 (20) 3 710° (6) 4 7 8 
(3), pes 85: @ 1910 (5) 279 @) 3 8 9 (27) 5 610 
4 14 6 (10) 2 3 6 (ie) 2 810 (22) 3 910 (28) 5 7 8 
() 16 7 (i) 2 410 (7 3 4 7 (23) 4 5 9 (29) 6 7 10 
() 16 8 (12) 2 5 8 (18 3 4 8 (24) 4 510. (80) 6 8 9 
Plan 11.16 t=10,k =4,7r =6,b = 15, d = 2, E = 0.83, Type III 
Block 
@ 12, 34 (6) 1 6 8 10 (i) 3 5 °9 10 
(2) tae boee 0 (() ye (12) 36 7 10 
(BY S10 8m 88 (8) 2 4 7 10 (13) 8 45 8 
(4) 14 9 10 (9) 2 5 8 10 (14) 4,5 6 7 
(6) OL 67, 919) (0) 2.7 8 9 (15) 46 8 9 
Plan 11.17 t=10,k =5,r =9,b = 18, \ =4, EB = 0.89, Type II 
Block 
(LY V2 18 BAO: (7 145 610 (3) 2 5 6 8 10 
(2) eZ RSC e 37 (8) 148 9 10 (14) 2 6 7 9 10 
@) 12 4 6 9 (9) 1579 10 (15) 3 4 6 7 10 
(AylueQipar ees (10) 2Se 488\ 030! ~ (10) 84 bo 
® 13868 9 (il) 2359 10 (7) 356 8 9 
(Oe ES eee 10) (12) a 78d. 08) 4 6 8 
Plan 11.18 t=10,k =6,r =9,b = 15, =5, BE = 0.93, Type III 
Block ’ 
@) 12 4656.8 9 (Cyr2 eS SEO Sa 0: | ii esa b win 1810 
(2) be 6 ee 858) 10 (7) 267s 9 210 - (12) 9 1236 7 10 
(3) 2 4 5 6 9 10 (G)iarowo mG sneo (lo) ates be Carr. 78 
4 1246 7 8 (Dy Ls Bess 9) 10: (14), 8 4516. 10 
(onda tices uoest04 VIO) 12d) 4 Soy 9 CO) 8, 458 7 8 
Plan 11.19 t=11,k =5,r =5,b =11,\ = 2, BH = 0.88, Type IV 
Block 
(EI 2 SSA be 28) (7) ALSO A As 
()°2 384 6 9 (8) 8 9 10 1 4 
(3) 3.4 5 7 10 (9), 39! 40) Th 2) 6 
(4) 45 6 8 il (10) 103 The Sly 316 
() 5 SOR ire Or 1T1)) S11 SET OR aN: 
(6) 06. (ae lee 
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Plan 11.20 t=1l1,k =6,r =6,b =11,d =3, E = 0.92, Type IV 
Block 
(D #56 “7 9 100 At (A). TZ aso Gro 
(2) ib ES) Oe ar (8) 2c 36) Fes 
@) Pveoyes sy oma Oy: SAG eS 
(4). SE Ce GO (10) 2h 4aat5y 78s 9 
(B) QES eA aBe 10 ee (1) 3: 6156) 8 9 40 
6 13 45 9 
Plan 11.21 t= 13,k =3,r = 6,6 = 26,4 =1, E = 0.72, Type II 
Block 
(a eae (0) 2 7 10 (9) 4 11 12 
(2) ke Faas (1) 2 8 12 (20) 5 6 8 
2) [Uae (12) 34 8 (21) (572 
@®) Tew (13) 3 5 10 (22) 95 9) th 
(6) 1 10 i (14) 38 6 12 (232) 6 9 18 
(6) 1 12 18 aij) Gy RAS SY (24) 7 8 Il 
() 2 4 =9 (16) 3 11 18 (25) 8 10 138 
(8) 2 5 18 (17) 4 6 10 (26) 9 10 12 
(9) 2 6 it as) 4.7 «+18 
Plan 11.22 {= 13,k =4,r=4,b = 13, =1, Z = 0.81, Type IV ; 
Block 
qd) 1 2 4 10 (is See acliec3 (CL) e Tito ees: 
(2) 2.35 2 (ie AA acs (12) 12 18 2 8 
(3) 3 4 6 12 (), 8 9 11 4 (18) 18) 1 3) 9 
(4) 4 5 7 18 (9) 9 10 12 5 
(by On Ome Sie (0) 10 11 18 6 
Plan 11.28 t =13,k =9,7r =9, b = 13, \ = 6, EF = 0.96, Type IV 
Block 
(1) Bap GiL ae Sa 9) 11 12 18 (Biel) ho eb Ge eda 13 
(2) TRA 68 8: 10 12 13 (Oe TL S2S4, Cnt 8 11 13 
(8)' Lb 7 8° 9 10 11 18 10) 2 Sao es 9 12 
(Ca 10 11 12 (ii) 273400) 68s Ce 10 13 
(Gyo ne eee) li 12 18 (2): P8846. 62 758 1031 
(6) ISSA oes 10 11 12 13 (3) 2 4 5 6 7 8 10 Thee) 
(DY 4b) Oa 8 11 12 138 
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Plan 11.2, = 15,b =3,r=7,b =36,d =1,E =0.71, Type T 
Block Rep. I Rep. II Rep. II Rep. IV 
aby ae Pe (Oj) ps (yp Wenz qe) is) 9 
@) 4 8 12 (7 2 8 10 a2) 2.9 i (7 2 13 15 
(3) 5 10 15 () 3 13 14 (13) 3 12 15 Gs) ey 
(4) 6 il 13 @ 6 9 15 (4) 4 10 14 a9) 5 ll 14 
() 7 9. J4 (10) 7 11 12 (15) 5 8 18 (20) 6 10 12 
Rep. V Rep. VI Rep. VII 
Ql) 1 10 1 (26) 1 12 18 1) 1 14 15 
(22) 2 12 14 Qn 2.5 7 (32) 2 4 6 
3) 35 6 (28) 3 9 10 (es) Ge ehan 
(24) 4 9 18 9) 4 11 15 (34) 5 9 12 
(25) 7. 8 15 (30) 6 8 14 (35) 7 10 18 
Plan 11.95 1=15,k =7,r =7,b = 15,» =3, B = 0.92, Type IV 
Block 
(48), TO) CL (9) 2469 ll 12 14 
(Qyemmom sin 9. 10h at (10) 25 7 8 10 12 14 
©) ia eens ay 2579 ll 13 15 
@ 14.5) 8) 9 14 15 (2) 34 7 8 1 13) 14 
(Ou tes pedo at 12513 (3) 3 4.7 9) 10 12 15 
i(j) Gh Rene Oe (4 3-5) 6 Sella) 15 
@ Lien7 iO il 14 15 (5) 3.5 6 9 10 13 14 
@) 24 6 8 10 13 15 
Plan 11.96 1=15,k =8,r =8,b = 15, =4, E = 0.94, Type IV 
Block 
@ 8 9 10 11 12 13 14 15 ® 135 7 8 10 13 15 
@45 6 71213 4 1 (0) 1346 9 1 18 15 
Q) 45 6 7-8 9 10 11 (1) 13 4 6 8 10 12 14 
(Cy De ee) (2) 125 69 10 12 15 
@ 23 67 8 9 M15 (3) 125 6 8 al ia 14 
6) 23 4 510 1 4 15 (4 1247 9 10 13 14 
() 2) By aw) 8 9) 12) as (5) 1247 8 li 12 15 
(8) Lo 8i Bez) Bom tasers SS ti ee 


PLANS 335 
Plan 11.27 t =16,k =6,r =6,b = 16, \ = 2, HE = 0.89, Type IV 
Block 
() 18 14 15 4 8 12 9 5 7 8 2 10 14 
(Q\yr Témy Seog qo) 1 2 3 8 12 16 
(3) O0M0l 104 8eI6 Q)- 1 3) 4.6 10 (14 
(chy iy el ah Bes 2)\y, 13 RAI San zal 
(Go) OP ae 3))- el 2 TPL, BLS. 
(6) 13 15 16 2 6 10 a4 5 6 7 4 12 16 
@) (9EA0N 1258s 7015. (15) 28 18 4 bn 9 ls 
(8), “LO AT eo Tes bss (16) ‘14° 15 16) Lo e8. 
Plan 11.28 t= 16,k =6,r =9, b = 24, = 3, E = 0.89, Type II 
Block 
(Qi); 225 bs 6 Sis 12: (13)) {2h bere 
(2)2 M2 eee le (4) (25 SOO ee On 
(3) 12 9 10 15 16 (5) 2 4 10 12 18 15 
(4) sabe el0 le (16) 3 4 5 6) 15) 16 
(5), SSA GR eSyets) 1b (L7) FOU See 10) 
@® 13 9 1 14 16 (igs) 3 4 11 12 13 14 
(Bn Teo: BS e10 a qo) 56 6 9 10 13 14 
(Sp ieee Oe ars AS (20) 5979 1 180 1 
(9) 1 4°9 12 14 15 (21) 5 8 9 12 13 16 
(10)s Ziadie once. lb (22) 96: 7 10) a de 
@i1)s 2arS On by ed (23) 6 8 10 12 14 16 
(12)s) 2S LON ETS 16 (24) 7 8 11 12 15 16 
Plan 11.29 t = 16, k = 10,r = 10, b = 16, \ = 6, E = 0.96, Type IV 
Block 
(1), DBS SES OMS 0 e 16 (9) 1346 9 11 12 13 15 16 
(2) 23.4 6 10 11 12 14 15 16 (10) 4 56 6-7 9 10 11 18 14 16 
(je TEPPER ip Gr habs Wee 15 (AD: 2QUBCES 0) 112) SY eG 
(4) 1247 9 10 12 18 14 16 (2) 1245 6 8 9 10 12 15 
(5) 1S Pare 7 18 10Ns, 15 16 (13) 35 6 8 9 10 12 13 14 16 
(6) AGEL tGs Wee oa 2 14 (4) 123 8 9 10 11 13 14 16 
(O) ETA oes Is. 14 16 (5) 16 7 8 10 11 12 14 15 16 
(CHP nee ee 2) 14 15 16 (a6) 2346 7 8 10 il 12 13 
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BALANCED INCOMPLETE BLOCKS 


Plan 11.80 1 =19,k =3,r =9,b =57,\ =1, B = 0.70, Type II 


Block 
(ft). 29 a6) 2 9 it (30) 4 13 14 (44) 8 14 19 
Qnlag sae6 (7) 2 10 18 (31) 5 6 (45) 8 15 18 
(Byo eet ets ag) 3 4 15 (82) 5 % 10 (46) 8 16 17 
@ 15.16 (9 3 5 14 (33) 5 8 9 7) 9 13 19 
(65) ied BEL Sica 2 (20) 3 7 12 (84) 5 12 17 (48) 9 14 18 
(6) 1 8 18 @ly 3) 811 (35) 5 13 18 (49) 10 12 19 
(Li 9a 72 (22) 3 10 138 (88) 78) eS (50) 10 14 17 
(8) 1 10 1 (23) 3 9 16 (37): 6.9)'_ 10 (1) 10 15 16 
(9). 2. 15: 17 (24) 3 18 19 (88) 6 18 15 (62) 11 12 18 
CLO) 2 18) S17 (25) 4 5 19 (39) 6 16 19 (58) “1. 18) 17 
Gi) 2) 16 (26) 4 6 12 (40) 6 17 18 (4) 11 14 16 
(2) 0 25 15 (27) A ad (41) 7 #9 15 (65) 11, 15 19 
(18) 2 6.14 (28) 4 8 10 (42) 7 16 18 (56) 12 13 16 
(14) 2 7 18 (29) 4 9 17 (48) 7 17 19 (57) 12 14 15 
(16) 2 8 12 
Plan 11.81 t=19,k =9,r =9,b =19, =4, E = 0.94, Type IV 
Block 
(PLC QNB WA VbERGT U7 48 59 (Qi) 2 73\ 5! 6 11. 13/916" 17 18 
(2) 12 4 5 10 13 14 17 19 (12) 23 8 9 10 12 17 18 19 
(3) DeQeINS Tl 2. 14 16. 17 (13) 2 4 6 8 10 14 15 16 18 
(4) 126 9 12 13 15 16 19 (14) 25 79 11 14 15 18 19 
(5) 18 4 6 11 12 14 18 19 (15) 34 6 9°12 14°15 16 17 
() 13 7 9 10 13 14 16 18 (16) 3 6 7 8 18 14 15 17 19 
(7) 138 5 8 10 11 15 16 19 (7) 4 5 7 8 12 13 16 18 19 
(8) 1 4 8 9 11 13 15 17 18 (48) 467 9 10 11 16 17 19 
(9) 15 6 7 10 12 15 17 18 (19) 5 6 8 9 10 11 12 13 14 
(10) 23 4 7 10 11 12 13 15 


PLANS 337 
Plan 11.82 t =19,k =10,r = 10, = 19, =5, E = 0.95, Type IV 
Block 


: (1) 10 11 12 18 14 15 16 17 18 19 (il) 14789 10 12 14 15 19 
2) 8 6 7 8 9111215 1618 (12) 1456 7 11 13 14 15 16 

(3) 38 4 5 6 9 10 13 15 18 19 (3) 13579 11 12 18 17 19 
l (4) 8 4 5 7 8 10 11 14 17 18 (14) 13 46 8 10 12 13 16 17 
| (5) 2 5 7 8 9 10 13 15 16 17 (15) 126 7 8 10 11 13 18 19 
| (6) \2 4 5 6 8 Il 12 15 17 19 (6) 12459 10 11 12 16 18 
} () 2 4 6 7 9 12 13 14 17 18 (i7) 1236 9 10 11 14 15 17 
. (8) 2 3 86 7 10 12 14:16 19 (8) 12358 12 13 14 15 18 
(9) 32023) 8 Soe als: 14 16 19 (19) 1234 7 15 16 17 18 19 
(10) 1 5 6 8 9 14 16 17 18 19 

Plan 11.88 t=21,k =3,r =10,b = 70, =1, E = 0.70, Type I 

| Block Rep. I Rep. IL Rep. III Rep. IV Rep. V 
\ QM 1-233 (8) 1 415 (16) 1 617 (22) 169 (20) 1 7 21 


(2) 4 5 6 (9) 2 511 (16) 2 414 (28) 2 716 (30) 218317 
(3) 7 8 9 (10) 3 916 (17) 3 711 (24) 8 821 (81) 3 10 18 
(4) 10 11 12 (11) 61720 (18) 6 10 19 (25) 41719 (82) 4 811 
8 
9 


(5) 1381415 (12) 71219 (19) 1620 (26) 51013 (33) 5 1619 
(6) 16 17 18 (18) 81318 (20) 1518 (27) 111520 (84) 6 12 15 
(7) 19 20 21 (14) 10 14 21 (21) 12 13 21 (28) 12 14 18 (35) 9 14 20 


Rep. VI Rep. VI Rep. VIII Rep. IX Rep. X 
(36) 1 810 (48) 111 18 (50) 11220 (57) 118 19 «(64) «1 «14 16 
(37) 21819 (44) 2 10 20 (51) 2 6 8 (58) 2 9 12 (65) 215 21 
(88) 31517 (45) 3 5 12 (52) 31419 (59) 38 420 (66) 3 618 
(39) 41216 (46) 4 918 (538) 418 21 (60) 5 814 (67) 4 710 
(40) 5 921 (47) 6 16 21 (54) 5 715 (61) 6 718 (68) 5 18 20 
(i) 61114 (48) 71417 (65) 91017 (62) 10 15 16 (69) 8 12 17 
(42) 71320 (49) 8 15 19 (56) 11 13 16 (63) 11 1721 (70) 9 11 19 


Plan 11.84 .1 = 21, k = 5,7 =5,b = 21, = 1, B = 0.84, Type IV 


Block 
(iene eue etal (8) 12 16 18 (15) 5 10 16 20 
(2), ES Omre TST, (9) 11 16 19 (16) 6 9 15 20 


48 3 
1 6 4 
@) SOM 12017 (10) 2.5 12 15 19 a7 1 8 10 15 21 
(4) 138 14 15 16 17 (i) 3 8 9 14 19 (is) 2 °7. 9 16N2t 
(6) 2t 5 9 13.18 (2) 4 7 10 13 19 (ig) 38 6 12 18 21 
(6) 2 6 10 14 18 (13) 1 7 12 14 20 (20) 4 5 ll 14 21 


O77 B70 tip leas (144) 2 8 11 13 20 (21) 17.18 19 20 21 
pe 
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BALANCED INCOMPLETE BLOCKS 


Plan 11.85 t =21,k =7,r =10,b = 30, = 3, E = 0.90, Type II 


Block 


(09) 
(2) 
(3) 
(4) 
(5) 
(6) 
(7) 
(8) 
(9) 
(10) 


(11) ° 


(12) 
(13) 
(14) 
(15) 


Po) PAT] or RY] co] HE] tT] | or] 9] 0 


Plan 11.36 
Block 


(1) 
(2) 
(3) 
(4) 
(5) 
(6) 
(7) 
(8) 
(9) 
(10) 
(11) 
(12) 
(18) 
(14) 
(15) 
(16) 
eu) 


25 


10: TLE Le 
U1) As 


Bl wll ela oloalml Aloo tle al olan 
ro) 
= 
& 
pa 
i 


20 
21 
15 
16 


(16) 
(7) 
1g) 
(19) 
(20) 
(21) 
(22) 
(23) 
(24) 
(25) 
(26) 
(27) 
(28) 
(29) 
(30) 


t=25,k =4,r=8,b = 50, =1, 


(18) 
(19) 
(20) 
(21) 
(22) 
(23) 
(24) 
(25) 
(26) 
2) 
(28) 
(29) 
(30) 
(31) 
(32) 
(33) 
(34) 


CO] AY A] OR] Or] | C9] DOP ST] | or) | 09] tO 


m| | 00] or] BR] oo} role la 


ol) rmle| ao! ox 


eo] oo} tol | a] | orl | 0 


S 


11 


12 


E = 0.78, Type III 


(35) 
(36) 
(87) 
(88) 
(39) 
(40) 
(41) 
(42) 
(43) 
(44) 
(45) 
(46) 
(47) 
(48) 
(49) 
(50) 


3 
17 


16 


19 


PLANS 339 

Plan 11.87 t =25,k =9,r =9,b = 25,  =8, E = 0.93, Type IV 
Block 
(iwKGree ea me (4) 26 7 10 12 18 20 23 25 
(2) 12 3 14 15 18 19 22 23 (15) 26 7 11 13 19 21 22 24 
(3) 12 3 16 17 20 21 24 25 (16) 3 4 8 10 13 18 21 23 24 
(4) 1 47 10 I 14 15 20 21 (7) 3 4 8 11 12 19 20 22 25 
(3) 1 4 7 12 18 16 17 18 19 (18) 3.5 7 10 13 15 16 22 25 
6) 15 9 10 11 18 19 24 25 (9) 3.5 7 11 12 14 17 23 24 
() 1 5 9 12 13 20 21 22 28 (20) 3 6 9 10 13 14 17 19 20 
(8) 16 8 10 ll 16 17 22 23 (21) 3 6 9 11 12 15 16 18 21 
(9) 1 6 8 12 18 14 15 24 25 (22) 45 6 14 16 18 20 22 24 
(10) 2 4 9 10 12 15 17 22 24 (3) 4 5 6 15 17 19 21 23 25 
(i) 24 9 1 13 14 16 23 25 (4) 78 9 14 17 18 21 22 25 
(12) 2.5 8 10 12 14 16 19 21 (25) 7.8 9 15 16 19 20 23 24 
(13) 25 8 11 138 15 17 18 20 
Plan 11.88 t=28,k=4,7r=9,b= 63, \ = 1, E = 0.78, Type I 
Block Rep. I Rep. II Rep. IIT 
(1) 28 1 10 19 (8) 28 2 11 2 (15) 28) aoa" 28 
(2) 2 9 13 16 Oy Fay (16) 4 2 15 18 
(3)~ Bune eters a) ae an oo ent 
(4) 4 7 28 24 (uu) 5 8 24 25 (18) 6 9 25 26 
(6) 5 6 20 27 (cb) a a> Oe) (19) 7 8 221) 20 
(6) 12 179-22 25 (13) 13 18 23 26 (20) 14 10 24 27 
(7) 14 15 21 26 (14) 15 16 22 27 (21) 16 17 23 19 

Rep. IV Rep. V Rep. VI 
(22) 28 4 13 22 (29) 28 5 14 23 (36) 28 6 15 24 
(23) 5 38 16 10 (30) 6 4 17 It (37) 7 5 18 12 
(24) 6 2 14 12 31) 7 3 15 18 3s) 8 4 16 14 
(25) 7 1 26 27 (32) 8 2 27 19 39) 9 3 19 20 
(26) 8 9 28 21 (3) 9 1 24 22 (40) 1 2 25 28 
(27) 15 11 25 19 (34) 16 12 26 20 (41) 17 138 27 21 
(28) 17 18 24 20 (35) 18 10 25 21 (42) 10 11 26 22 

Rep. VIL Rep. VIII Rep. IX 
(48) 28 7 16 25 (50) 28 8 17 26 (57) 28 9 18 27 
(44) 8 6 10 13 6) @ 7 41 a4 (8) 1 8 12 15 
(45) 9 5 17 15 (52) 1 6 18 16 (69), Con ty 0a? 
(46) 1 4 20 21 (58) 2 5 21 22 (60) 8 6 22 28 
(47) 2 8 26 24 (4) 3 4 27 25 61) 4 5 19 26 
(48) 18 14 19 22 (55) 10 15 20 23 (62) 11 16 21 24 
(49) il 12 27 23 (56) 12 13 19 24 (63) 13 14 20 25 
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BALANCED INCOMPLETE BLOCKS 


Plan 11.39 t =28,k =7,r =9,b = 36, d = 2, H = 0.89, Type III 


Bloek 
Q) 4°78 9 144 23 28 (1s) 4 8 il 17 19 21 25 
Q 2 910 T1524 (20) 1 dd 14 18 23 25 26 
@) 6 8 138 15 16 18 21 Ql) 2 4 5 6 16 22 23 
() “7 12 13 17 22 24 25 (22) 8 4 10 11 12 14 18 
(5) 4 10 16 17 20 26 27 (28) 1 9 14 16 17 19 22 
6) 2 11 18 19 22 26 28 (24) 1 2 4 18 20 24 28 
(7) 1 8 6 12 1923 27 (25) 38 6 8 17 28 2A 2% 
@® 2 8 5 14 20 21 25 (26) 5 6 7 10 19 25 28 
(0), saesn0 12946095 @7) Lue 7 8.11 2 26 
(0) 2.3 6 9 ii 18 17 (28) 9 10 18 19 20 21 23 
(il) 4 6 12 18 15 19 26 (22) 2 8 14 15 19 24 27 
(2) 8 7 16 18 1920 24 (80) 3 9 15 16 25 26 28 
(8) 6 10 14 21 22 24 26 (31) 5 8 9 12 18 20 22 
(14) 11 15 20 22 23 25 27 (82) 11 12 16 21 23 24 28 
pen MERISMaIeeD TAS.» (83). sNmsune 7 lb: at 622 
(ie) 2 7 9 12 21 26 27 (34) 5 7 WL 18 14 16 27 
(De eBrsponds1220 2728 (85) 46 9 18 24-25 27 
(18) 6 12 14 15 17 20 28 (36) 2 7 10 15 17 18 23 
Plan 11.40 t=31,k =6,r =6,b =31, =1, B = 0.86, Type IV 
Block 
(1) 1 611162126 (12) 6 223191528 (22) 11 218 9 25 30 
(2) 2 712172226 (13) 11 7 8242028 (28) 21 12 3 19 10 30 
(3) 8 813182326 (14) 1612 8 42528 (24) 6 2213 4 20 30 
(4) 4 91419 2426 (15) 211718 9 528 (25) 16 7 23 14 5 30 
(5) 510 15 20 2526 (16) 11223 92029 (26) 1 7 13 19 25 31 
(6) 123 4 527 (17) 16 213241029 (27) 21 2 8 14 2031 
() 678 91027 (18) 617 3142529 (28) 1622 3 9 15 31 
(8) 111213 141527 (19) 21 718 41529 (29) 11 17 23 4 10 31 
(9) 16 1718 19 2027 (20) 1122 819 529 (80) 61218 24 5 31 


(10) 21 


22 23 24 25 27 


* (21) 117 8 24 15 30. 


CLE 


22 18 14 10 28 


(31) 26 27 28 29 30 31 


PLANS 


Plan 11.41 t = 31, k =10,r = 10, b = 31, \ =3, E = 0.93, Type IV 


341 


Block 

(1) 16 9 12 17 18 24 26 27 29 (17) 5 610 8 18 21 26 28 22 31 
(2) 27 10 13 18 19 25 27 28 29 (18) 6 711 9 19 15 27 22 23 31 
(8) 3111 14 19 20 26 28 22 29 (19) 7 112 10 20 16 28 23 24 31 
(4) 4212 8 20 21 27 22 23 29 (20) 1 213 11 21 17 22 24 25 31 
(5) 5818 9 21 15 28 23 24 29 (21) 2 8 14 12 15 18 23 25 26 31 
(6) 6 4 14 10 15 16 22 24 25 29 (22) 1 2 4 8 911 15 16 18 28 
(7) 75 8 11 16 17 23 25 26 29 (23) 2 8 5 9 10 12 16 17 19 22 
(8) 25 10 11 15 20 24 26 27 30 (24) 3 4 6 10 11 13 17 18 20 23 
(9) 36 11 12 16 21 25 27 28 30 (25) 4 5 71112 14 18 19 21 24 
(10) 4 7 12 13 17 15 26 28 22 30 (26) 5 6 11218 8 19 20 15 25 
(11) 5 118 14 18 16 27 22 23 30 (27) 6 7 21314 9 20 21 16 26 
(12) 6214 8 19 17 28 23 24 30 (28) 7 1 814 8 10 21 15 17 27 
(18) 73 8 9 20 18 22 24 25 30 (292) 1 2 38 4 5 6 7 29 30 31 
(14) 14 9 10 21 19 23 25 26 30 (30) 8 9 10 11 12 13 14 29 80 31 
(15) 34 8 18 16 19 24 26 27 31 (31) 15 16 17 18 19 20 21 29 30 31 
(16) 45 9 14 17 20 25 27 28 31 

Plan 11.42 t =87,k =9,r =9,b = 37, \ = 2, E = 0.91, Type IV 

Block 

(1) 34 18 20 36 3 31 25 33 21 (20) 26 716 1 9 83 10 34 12 
(2) 12 4 86 15 37 13 10 19 29 (21) 34 2018 17 15 5 24 4 9 
(3)) 8 37 382 6 9 15 11 25 33 (22) 30 36 17 29 21 9 27 82 16 
(4) 21 2 5 29 4 28 33 11 7 (23) 8 12 20 18 23 21 27 7 37 
(5) 36 24 2 34 6 23 28 16 37 (24) 11 16 27 22 12 25 24 4 381 
(6) 35 11 18 9 12 36 3 14 28 (25) 32 138 22 715 216 3 18 
(7) 32 12 24 20 35 2 33 30 19 (26) 35 21 19 18 16 6 25 5 10 
(8) 211 8 34 27 35 10 13 17 (27) 24 29 3 25 87 135 7 17 
(9) 19 34°29 11 1 18 23 32 31 (28) 12 22 5 8 34 8 6 29 30 
(10) “15 21 12, 12 6<e1 17 14 (29) 26 35 21 14 82 4 22 37 34 
(11) 17 36 6 22 26 11 7 20 19 (30) 9 14 25 2 10 22 23 20 29 
(12) 24 3 26 11 23 10.30 15 21 (31) 19 16 238 17 33 8 3 4 14 
(18) 37 16 30 1 13 14 11 20 5 (32) 17 32 28 5 18 12 26 25 23 
(14) 3 4 6 20 10 32 27 28 1 (88) 23 36 27 35 15 5 22 1 33 
(15) 35 13 9 23 31 30 7 6 4 (34) 36 18 25 4 8 30 26 2 1 
(16) 14 27 18 24 6 29 26 13 33 (35) 8 24 10 32 7 31 14 & 36 
(17) 8 13 24 9 21 19 28 1 22 (36) 5 26 3 19 27 2 31 37 9 
(18) 37 30 31 22 10 28 33 17 18 (87) 28 19 25 30 34 15 7 14 27 
(19) 20 29 15 8 26 35 16 28 31 


Plan 11.48 


Block 
(1) 
(2) 
(3) 
(4) 
(5) 
(6) 
(7) 
(8) 
(9) 

(10) 
(11) 
(12) 
(18) 
(14) 
(15) 
(16) 
(17) 
(18) 
(19) 
(20) 
(21) 
(22) 
(23) 
(24) 
(25) 
(26) 
(27) 
(28) 
(29) 
(30) 
(31) 
(82) 
(38) 
(84) 
(35) 
(36) 
(87) 
(38) 
(39) 
(40) 
(41) 


SO] 00] NT] OD] or} B] eo] tO] 


36 
37 


41 


ele 
FR} SO] 0] co] a] oo] or] B] oo] Do] 


we] ey ee] 
or| | oo! to 


16 


30 
31 
32 
33 
34 
35 
36 


29 


41 


Oo} St] >] or} HH] co] to] 


) 
10 


29 


41 


| Co} a] @] er] A] co} wo] 


e 
oS 


11 
12 
13 


10 
i 
12 


41 


1 C0} tl ol or} BI co! nol 
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4l 


| st} oe] or} Bm] co] rope 


we) oo} pope 


t=41,k =5,r = 10. b = 82, =1, E = 0.82, Type III 


21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 


©| 00] a] oor} mw} co} ro} a] Ee 


rlelel eel el eel el ele 
S| oO] Oo} | S] ar) A} oo) mw] Al S 


39 
40 


wlalelel mle! rol a| = 


ford red ed red rnd ee 
al plolmlelolo 


PLANS 343 
Plan 11.44 t =57,k =8,r =8,b =57,d = 1, E = 0.89, Type IV 


Block 

(1) 1 8 15 22 29 36 43 50 (30) 22 2 81 11 40 20 49 54 
(2) 2 9 16 23 30 37 44 50 (31) 43 28 3 32 12 41 21 54 
(3) 3 10 17 24 31 38 45 50 (82) 15 44 24 4 33 13 42 54 
(4) 4 11 18 25 32 39 46 50 (33) 36 16 45 25 5 34 14 54 
(5) 5 12 19 26 33 40 47 50 (34) 8 87 17 46 26 6 35 54 
(6) 6 13 20 27 34 41 48 50 (35) 29 9 88 18 47 27 7 54 
(7) 7 14 21 28 35 42 49 50 (36) 1 23 45 18 40 18 35 55 
(8) 1 SB A 3b) 86) Ge sol (37) 29 2 24 46 19 41 14 55 
(9) 8) 9) 109 12 12 13) 445E (38) 8 30 3 25 47 20 42 55 
(10) 15 16 17 18 19 20 21 51 (39) 36 9 31 4 26 48 21 55 
(11) 22 23 24 25 26 27 28 51 (40) 15 37 10 32 5 27 49 55 
(12) 29 380 31 32 33 34 35 51 (41) 43 16 388 11 33 6 28 55 
(13) 86 37 38 39 40 41 42 51 (42) 22 44 17 39 12 34 7 55 
(14) 48 44 45 46 47 48 49 51 (43) 1 16 31 46 12 27 42 56 
(15) 1 44 38 32 26 20 14 52 (44) 36 2 17 382 47 18 28 56 
(16) 8 2 45 39 33 27 21 52 (45) 22 37 3 18 33 48 14 56 
(17) 15 9 8 46 40 34 28 52 (46) 8 23 88 4 19 34 49 56 
(48) 22 16 10 4 47 41 35 52 (47) 43 9 24 39 5 20 35 56 
(19) 29 23 17 11 5 48 42 52 (48) 29 44 10 25 40 6 21:56 
(20) 36 30 24 18 12 6 49 52 (49) 15 30 45 11 26 41 7 56 
(21) 43 37 31 25 19 13 7 52 (50) 1 9 17 25 33 41 49 57 
(22) 1 87 24 11 47 34 21 53 (51) 43 2 10 18 26 84 42 57 
(23) 15 2 388 25 12 48 35 53 (52) 36 44 3 11 19 27 35 57 
(24) 29 16 3 39 26 13 49 53 (58) 29 37 45 4 12 20 28 57 
(25) 43 80 17 4 40 27 14 53 (54) 22 30 38 46 5 13 21 57 
(26) 8 44 31 18 5 41 28 58 (55) 15 23 31 39 47 6 14 57 
(27) 22 9 45 32 19 6 42 53 (56) 8 16 24 32 40 48 7 57 
(28) 36 23 10 46 33 20 7 53 (57) 50 51 52 53 54 55 56 57 


(29) 1 30 10 39 19 48 28 54 


344 BALANCED INCOMPLETE BLOCKS 

Plan 11.45 t=78,k =9,r =9,b = 73,  =1, E = 0.90, Type IV 

Block 

(1) 1 9 17 25 33 41 49 57 65 (38) 49 58 35 28 13 6 23 48 69 
(2) 2 10 18 26 34 42 50 58 65 (89) 9 42 27 36 53 22 7 64 69 
(8) 3 11 19 27 35 43 51 59 65 (40) 33 18 51 12 29 46 63 8 69 
(4) 4 12 20 28 36 44 52 60 65 (41) 1 34 11 20 53 30 63 48 70 
(5) 5 13 21 29 37 45 53 61 65 (42) 33 2 59 28 45 22 15 56 70 
(6) 6 14 22 30 38 46 54 62 65 (43) 9 58 3 52 21 46 39 32 70 
(7) 7 15 23 81 89 47 55 63 65 (44) 17 26 51 4 13 38 47 64 70 
(8) 8 16 24 32 40 48 56 64 65 (45) 49 42 19 12 5 62 31 40 70 
(2) ta tee 2) ine 55. 165 7) 28566 (46) 25 18 43 36 61 6 55 16 70 
(10) 9 10 11 12 18 14 15 16 66 (47) 57 10 35 44 29 54 7 24 70 
(11) 17 18 19 20 21 22 23 24 66 (48) 41 50 27 60 37 14 23 8 70 
(12) 25 26 27 28 29 30 31 32 66 (49) 1 42 59 52 29 38 23 16 71 
(18) 33 34 35 36 37 38 39 40 66 (50) 41 2 19 36 13 54 63 32 71 
(14) 41 42 43 44 45 46 47 48 66 (51) 57 18 8 28 53 14 47 40 71 
(15) 49 50 51 52 53 54 55 56 66 (52) 49 34 27 4 61 46 15 24 71 
(16) 57 58 59 60 61 62 63 64 66 (53) 25 10 51 60 5 22 39 48 71 
(17) 1 10 19 28 37 46 55 64 67 (54) 33 50 11 44 21 6 31 64 71 
(18) 9 2 51 44 61 30 23 40 67 (55) 17 58 43 12 87 30 7 56 71 
(19) 17 50 3 36 29 62 15 48 67 (56) 9 26 35 20 45 62 55 8 71 
(20) 25 42 35 4 21 14 63 56 67 (57) 1 50 85 12 61 22 47 32 72 
(21) 33 58 27 20 5 54 47 16 67 (58) 49 2 43 20 29 14 39 64 72 
(22) 41 26 59 12 53 6 39 24 67 (59) 33 42 3 60 13 30 55 24 72 
(23) 49 18 11 60 45 38 7 32 67 (60) 9 18 59 4 37 54 31 48 72 
(24) 57 84 43 52 13 22 31 8 67 (61) 57 26 11 386 5 46 23 56 72 
(25) 1 18 27 44 18 62 39 56 68 (62) 17 10 27 52 45 6 63 40 72 
(26) 17 2 35 60 53 46 31 16 68 (63) 41 34 51 28 21 62 7 16 72 
(27) 25 34 8 12 45 54 23 64 68 (64) 25 58 19 44 53 38 15 8 72 
(28) 41 58 11 4 29 22 55 40 68 (65) 1 58 51 36 45 14 31 24 73 
(29) 9 50 48 28 5 38 63 24 68 (66) 57 2 27 12 21 38 55 48 73 
(80) 57 42 51 20 37 6 15 32 68 (67) 49 26 8 44 37 22 63 16 73 
(31) 33 26 19 52 61 14 7 48 68 (68) 33 10 43 4 53 62 23 32 73 
(32) 49 10 59 36 21 30 47 8 68 (69) 41 18 35 52 5 30 15 64 73 
(38) 1 26 48 60 21 54 15 40 69 (70) 9 34 19 60 29 6 47 56 73 
(834) 25 2 11 52 37 62 47 24 69 (71) 25 50 59 20 13 46 7 40 73 
(35) 41 10 3 20 61 88 31 56 69 (72) 17 42 11 28 61 54 39 8 73 
(36) 57 50 19 4 45 30 39 16 69 (73) 65 66 67 68 69 70 71 72 73 
(387) 17 34 59 44 5 14 55 32 69 
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Plan 11.46 t = 91, k =10,r = 10,6 = 91, =1, B = 0.91, Type IV 


Block 
(1) 
(2) 
(3) 
(4) 
(5) 
(6) 
(7) 
(8) 
(9) 

(10) 

(11) 

(12) 

(18) 

(14) 

(15) 

(16) 

(17) 

(18) 

(19) 

(20) 

(21) 

(22) 

(28) 

(24) 

(25) 

(26) 

(27) 

(28) 

(29) 

(30) 

(31) 

(82) 

(83) 

(34) 

(35) 

(36) 

(37) 

(38) 

(39) 

(40) 

(41) 

(42) 

(43) 

(44) 

(45) 

(46) 


1 10 19 28374655 647382 (47) 28 2 57 40 14 69 52 26 81 87 
211 20 29 38 47 56 65 7482 (48) 5529 3 67 41 15 79 53 27 87 
3 12 21 3039485766 75 82 (49) 19 7448 4 59 33 16 71 45 87 
4 13 22 31 40 49 58 67 76 82 (50) 46 2075 31 5 60 43 17 72 87 
B14 23 82 41 50 59 68 77 82 (51) 73 47 21 58 382 6 70 44 18 87 
6 15 24 33 42 5160697882 (52) 10 65 39 2277 51 7 62 36 87 
7 16 25 84 4352617079 82 (53) 37 11 66 49 23 78 84 8 63 87 
817 26 35 44.53 6271 8082 (64) 6438 12 76 50 24 G1 35 9 87 
9 18 27 36 45 54.6372 81 82 (55) 1 47 66 76 14 33 43 62 27 88 
123456 7.8 983 (56) 64 248 31 77 15 25 44 63 88 
f0 1112 13 141516171883 (57) 4665 3 13 32 78 61 26 45 88 
19 20 21 22 23.24.25 2627 83 (58) 37.5621 4 50 69 79 17 36 88 
58 29 30 31 32 33 84.35 36.83 (59) 19 38 57 67 5 51 34 80 18 88 
37 38 39 40 41 42 43 4445 83 (60) 55 20 89 49 68 6 16 35 81 88 
46 47 48 49 50 5152535483 (61) 73 11 30 40 59 24 7 58 72 88 
55 56 57 58 59 60 G1 626383 (62) 28 74 12 22 41 60 70 8 54 88 
6465 66 67 68 69 70 717283 (63) 10 29 75 58 23 42 52 71 9 88 
73.74.75 76 77 7879 8081 83 (64) 1 74 39 67 32 24 52 17 63 89 
120 12 58 77 69 84.53 45.84 (65) 37 2 75 22 68 33 61 58 18 89 
0 2 21 67 59 78 43 85 54.84 (66) 73 88 3 31 23 69 16 62 54 89 
19 11 3 76 68 60 62 44.36 84 (67) 4611 57 4 77 42 70 35 27 89 
584739 423 15 61807284 (68) 55 47 12 40 5 78 25 71 36 80 
37.20 48 135 24706281 84 (69) 10 56 48 76 41 6 34 26 72 80 
46 38 3022 14 6797163 84 (70) 64 29 21 49 14 60 7 80 45 89 
55 74 66 315042 7261884 (71) 19 65 30 58 50 15 43 8 81 89 
6456 75 40 32 5116 82784 (72) 28 20 66 13 59 51 79 44 9 80 
73 65 67 49 41.33.2517 984 (73) _1 65 48 40 23 60 79 35 18 90 
Ti1 2131 41 51 617181 85 (74) 46 2 66 58 41 24 16 80 36 90 
9 2 12 49 32 42 79 6272.85 (75) 64 47 3 22 50 42 34 17 81 90 
10 20340 503370 80 63.85 (76) 73 29 12 4 68 51 43 26 63 90 
5565 75 41404 34 4454.85 (77) ‘(10 74 30 49 5 69 61 44 27 00 
7356 66 22.515 52354585 (78) 28 11 75 67 50 6 25 62 45 90 
BE 7457 13 236 43.53.3685 (79) 37 20 57 76 3215 7 71 54 90 
5B 9g 48 58 6878 7172785 (80) 55 38 21 13 77 33 52 8 72 90 
7629 39 76 5969 25 81885 (81) 19 56 39 31 14 78 70 53 _9 90 
37 47 30 67 77 60 16.26 985 (82) 1 38 75 49 59 15 70 26 36 O1 
129 57 2250 78 16 447286 (83) 73 2 39 13 50 60 34 71 27 OL 
BB 230 76 2351701745 86 (84) 37 74_3 58 14 51 25 35 72 OL 
3856 3497724437118 86 (85) 64 20 30 _4 41 78 52 62 18 OL 
10 38 66 432 6025538186 (86) 28 65 21 76 5 42 16 53 63 91 
841130 58 53379 265486 (87) 19 20 66 4077 6 61 17 54 M1 
37 G5 12 31.59 652.8027 86 (88) 46 56 12 67 23 33 _7 44 81 OL 
1947 75 13 41.09 735.6386 (89) 10 47 57 31 68 24 79 8 45 91 
7350 48 67 14 42 61 830.86 (90) 55 11 48 22 32 69 43 80_9 OI 
ie 74 01 40 68 1534.62 986 (91) 82 83 84 85 86 87 88 89 90 91 


1 56 30 18 68 42 25 80 54 87 


CHAPTER 12 


LATTICE SQUARES 


12.1 Description 


12.11 Balanced Lattice Squares. The number of treatments must be 
an exact square. Within each replicate, the k? treatments are arranged 
on the plan in a k X k square. The method of grouping into rows and 
columns, which varies in successive replications, is such that the treat- 
ment means can be adjusted for differences among the rows and columns 
of each square. Thus, in addition to the elimination of differences among 
replicates from the experimental errors, the design permits a “double 
control” within each replicate, similar to that obtained in a latin square. 

The principle which governs the grouping into rows and columns may 
be seen from the plans 12.1-12.8. With 9 treatments, for instance, treat- 
ment | occurs in the same row as treatments 2, 3, 6, and 8, and in the 
same column as treatments 4, 5, 7, and 9. Thus every other treatment 
appears with treatment 1 either in the same row or in the same column. 
More generally, any pair of treatments occurs once in the same row or in 
the same column. 

This property holds for all plans having an odd number of treatments, 
i.e., for 9, 25, 49, 81, 121, and 169 treatments. The number of replicates, 
for k? treatments, is (k + 1)/2. The standard error of the difference is 
not exactly the same for all pairs of treatments, though the variation in 
accuracy is small. Separate formulae are given for the two standard 
errors for cases in which they may be needed. 

A design with twice the basic number of replicates (e.g., 25 treatments 
in 6 replicates) is obtained by repeating the plan, the rows being inter- 
changed with the columns. In designs so produced, every pair of treat- 
ments occurs together once in the same row and once in the same column. 
On account of this property, the standard error of the difference is the 
same for all pairs of treatments, whether the row and column differences 
are large or small. 

Within the most useful range, the only even numbers of treatments 
which provide lattice square designs are 16 and 64. These designs have, 
respectively, 5 and 9, i.e. (& + 1) replicates. Every pair of treatments 
occurs once both in the same row and in the same column. 

The available selection of designs (up to 12 replicates) is summarized 
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in table 12.1. By analogy with balanced lattices, the designs may be 
called balanced lattice squares. 


TABLE 12.1 AvalLaBLE NUMBERS OF TREATMENTS AND REPLICATIONS FOR 
BALANCED LATTICE SQUARES 


Number of treatments 9 16 25 49 64 81 121 169 
Number of replicates 4,8 5,10 3,6 48 9 5,10 6, 12 ¢ 


For 16 treatments in 10 replicates, the basic plan is used twice. Al- 
though the basic plan (plan 12.1) requires only 2 replicates, the design 
for 9 treatments is not recommended with less than 4 replicates. Even 
in this case there are only 8 df. for estimating row and column variances 
and only 8 d.f. for error. Little would be gained over randomized blocks 
unless the extra controls were highly effective. A design with 9 treat- 
ments and 8 replicates is found by repeating the design for 9 treatments 
and 4 replicates. 

Certain factorial experiments can be arranged in balanced lattice 
squares; e.g., 3 %3,8X2,4%4,4xX2X2,24,5X5,7X7,8 X 8, 
43, 28,9 x 9,34. All main effects and interactions are confounded to the 
same extent with rows and columns. 


12.12 Partially Balanced Squares. Fora balanced design, the availa- 
ble numbers of replications are rather severely restricted. Although de- 
signs are possible for other numbers of replicates, they lack the symmetry 
of the designs described above, with the consequence that the statistical 
analysis usually becomes more complicated. It happens, however, that 
if the number of replications is less than that required for a balanced de- 
sign, the statistical analysis follows the same procedure as for balanced 
designs, apart from minor changes in some formulae. In experimenta- 
tion where double grouping has proved effective but where the replica- 
tions are not sufficient for a balanced lattice square, the additional de- 
signs shown in table 12.2 may be useful. 


TABLE 12.2 OTHER LATTICE SQUARE DESIGNS 


169 


Number of treatments 49 64 81 121 
3, 4, 5, 6 


Number of replicates 3 3,4 3, 4 3, 4,5 


When the number of treatments js odd, the plans for these designs are 
obtained simply by taking the desired number of replicates from the 
plans for the balanced designs. Thus, for 49 treatments in threefold 
replication, we use squares I to III from plan 12.4. With 64 treatments, 
squares I, III, and V are used for 3 replicates, and squares I, TI, V, and 


VII for 4 replicates. ; 
Partially balanced lattice squares are analogous to the triple, quad- 


ruple, and quintuple lattices. 
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12.13 Arrangement of Experimental Material. The method of ar- 
ranging the experimental material is similar to that for an ordinary latin 
square. With k? treatments, the units are arranged in k X k squares so 
that the rows and columns of each square correspond to the two types of 
variation whose effects we wish to eliminate from the errors. In field ex- 
periments, the plots of each replicate are usually laid out in square forma- 
tion, in which case row and column differences represent fertility vari- 
ations in two directions at right angles to each other. If the width of a 
greenhouse bench accommodates 5 pots, 25 treatments can be laid out in 
replicates which consist of 5 rows of 5 pots each. 


12.14 Randomization. Within each replicate the rows and columns 
of the basic plan should be permuted separately at random before apply- 
ing the treatments. It is also advisable to assign the treatments at ran- 
dom to the treatment numbers in the plan. 


12.2 Statistical Analysis 


12.21 Designs with (z + 1)/2, or Fewer, Replications. The same in- 
structions apply to the partially balanced designs and to the balanced 
designs with (& + 1)/2 replications. Table 12.3 shows the plan and 
yields for a lattice square with 25 corn hybrids in 3 replications, con- 
ducted by the North Carolina Experiment Station in 1942, Each plot 
contained 4 rows, with 10 hills per row. 

The computations proceed as follows. The number of treatments is 
k?, and the number of replications r. 

1, Find the row, column, treatment, replication, and grand totals, 
The treatment totals are placed in a square under the plan. 

2. For each row, calculate the L value, where 


L = total (from all replications) of all treatments included in the 
row — r times row total 
eg., 


LT, = 89.8 + 93.3 + 82.1 + 88.9 + 91.1 — 3(159.1) = —32.1 
As a check, the total of the L values over a replication equals 


(Grand total) — (replication total) 
Thus, 


—76.2 = 2189.4 — 3(755.2) 
Similarly for each column we obtain M values, 


M = total (from all replications) of all treatments included in the 
column — r times column total 


TABLE 12.3 Yrsups (pouNDS CORN PER PLOT) FOR A 5 X 5 LATTICE SQUARD IN 


3 REPLICATIONS 


‘Treatment numbers are shown in parentheses 


Square I Total L 5 
(18) 33.8 (9) 30.7 (11) 35.4 (2) 30.1 (25) 29.6 159.1 —82.1 —2.58 
(24) 24.6 (15) 30.8 (17) 28.8 (8) 348 (1) 32.5 151.5 —10.5 —0.84 
(12) 28.5 (8) 24.0 (10) 28.4 (21) 25.0 (19) 35.1 | 141.0 492.4 441.80 
(6) 26.7 (22) 27.2 (A) 25.6 (20) 25.0 (13) 29.4 133.9 411.0 +0.88 
(5) 40.1 (16) 35.7 (23) 30.1 (14) 30.3 (7) 33.5 169.7 —67.0 —5.39 
Total 153.2 148.4 148.3 145.2 160.1 | 755.2 —6.13 
M —16.0 +3.8 —12.8 —10.0 —41.2 —76.2 
e — 0.83 +0.20 — 066 — 052 — 2.13 | —3.94 
Square IT 
(20) 30.9 (17) 33.3 (19) 38.8 (16) 27.7 (18) 34.4 | 165.1 — 45.9 — 3.69 
(18) 37.2 (12) 31.2 (14) 27.9 (11) 27.3 (13) 21.6 | 145.2 — 5.8 — 0.47 
(25) 82.7 (22) 43.0 (24) 28.5 (21) 24.7 (28) 22.7 151.6 — 46.2 — 3.71 
(5) 32.0 (2) 328 (4) 318 (1) 28.7 (8) 32.3 157.6 — 338 — 2.72 
(10) 39.8 (7) 87.3 (9) 31.9 (6) 340 (8) 34.3 177.8 — 69.3 — 5.57 
Total 172.6 177.6 158.9 142.4 145.3 | 796.8 —16.16 
M —62.2 —85.5 —30.1 —12.6 —10.6 —201.0 
2 Sg00) TAS eel. 0.65 — 0.55 | —10.41 
Square IT 
(19) 28.7 (15) 26.3 (23) 21.7 (6) 21.9 (2) 26.0 124.6 + 69.1 + 5.56 
(11) 19.4 (7) 17.8 (20) 16.9 (3) 22.6 (24) 24.2 100.4 + 98.0 + 7.88 
(22) 18.3 (18) 22.1 (1) 17.5 (14) 25.0 (10) 26.9 109.8 +105.9 + 8.51 
(5) 30.2 (21) 27.5 (9) 30.7 (17) 28.1 (13) 27.6 1441 + 9.3 + 0.75 
(8) 34.4 (4) 32.8 (12) 31.9 (25) 28.8 (16) 30.6 1685 — 51 — 041 
Total 131.0 126.0 118.7 126.4 135.3 637.4 4-22.29 
M +86.0 +61.6 +54.8 +46.8 +28.0 +277.2 
@ +445 +319 + 284 + 2.42 + 1.45 | +1435 
Grand total | 2189.4 0 0 
Treatment totals 
(1) 78.7 (2) 88.9 (3) 78.9 (4) 90.2 (6) 102.3 
(6) 82.6 (7) 88.1 (8) 103.5 (9) 93.3 (10) 95.1 
(11) 82.1 (12) 91.6 (13) 78.6 (14) 83.2 (15) 94.3 
(16) 94.0 (17) 90.2 (18) 89.8 (19) 102.6 (20) 72.8 
(21) 77.2 (22) 88.5 (23) 74.5 (24) 77.3 (25) 91.1 2189.4 
Adjusted treatment totals 
(1) 83.7 (2) 85.7 (3) 87.9 (4) 88.9 (5) 95.3 
(6) 84.4 (7) 81.6 (8) 100.1 (9) 87.4 (10) 97.4 
(11) 90.1 (12) 90.1 (13) 78.5 (14) 86.2 (15) 98.7 
(16) 85.5 (17) 83.8 (18) 93.8 (19) 107.0 (20) 77.0 
(21) 78.1 (22) 94.4 (23) 72.6 (24) 79.7 (25) 81.5 
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3. We now compute the analysis of variance. The total s.s. and the 
sums of squares for replications and treatments are found in the usual 
way. If the symbol L, denotes a replication total of the L’s, the sum of 
squares for rows within replications, adjusted for treatments, is 


ae LL 
Ir? — 1) rr — 1) 
(32.1)? + (10.5)? ++--+ (5.1)? i (76.2)? + (201.0)? + (277.2)? 
30 150 
= 1405.95 — $20.32 = 585.63 


The sum of squares for columns, eliminating treatments, is 
(16.0)? + (8.8)? +++++ (28.0)? (76.2)? + (201.0)? + (277.2)? 
30 150 


= 1058.53 — 820.32 = 238.21 
The complete analysis of variance is shown in table 12.4. 


TABLE 12.4 ANauysis oF VARIANCE FoR A 5 X 5 LATTICE SQUARE 


dif. 8.8. m.s. 
Replications (r—-) 2 546.88. 273.44 
Treatments (i? — 1) 24 611.09 25.46 
Rows (adj.) r(k — 1) 12 585.63 48.80E,, 
Columns (adj.) r(k — 1) 12 238.21 19.85E, 
Error (k—A(rk-—r—k—1) 24 229.79 9.57E, 
Total (rk? — 1) 74 2211.60 


4. This step leads to the adjusted treatment means. Let 
E, = mean square for rows 
H, = mean square for columns 
HE, = mean square for error 


ie (Z,—E.) 48.80 — 9.57 
k(r — 1)E, — (5)(2)(48.80) 


,_ Ee EH.) 19.85 — 9.57 
kK —1)E, (5) (2)(19.85) 


If Z, or Z, is less than E,, then \’ or x’, as the case may be, is taken as 0. 


= 0.0804 


= 0.0518 
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5. Multiply the L’s by )’ to give 6 values, and the M’s by uy’ to give 
« values, as presented in table 12.3. 

6. The adjusted total for any treatment is secured by adding to the 
unadjusted total the 6 and ¢ values for each row and column in which 
the treatment appears. For treatment (1), we have 


78.7 + (—0.84) + (—2.13) + (—2.72) + 
(—0.65) + (8.51) + (2.84) = 83.7 


To obtain the treatment means (not shown) we divide by the number 
of replicates. 
7. The average variance of the difference between two adjusted treat~ 
ment means is 
2. 


, 


rk 
(k + 1) 


2(9.57) (3) (5) 
- [: shar ae (0.822) | 


= 8.49; se. = 2.91 


[: + ow +2)| = 


Except perhaps with the 5 X 5 and smaller squares, it is sufficiently ac- 
curate to use this figure for comparisons between any pair of treatments. 
More accurately, for two treatments that appear in the same row, the 
variance of the difference is : 


2E. 
(L+ @ — IN + rp] = 8.40; se. = 2.90 
2 


For two treatments in the same column. 
2E 


r 


° tt +r + (& — Ip'] = 8.58; se. = 2.93 


Clearly the average error is good enough in this experiment. For the 
partially balanced designs one more formula is needed, since some pairs 
of treatments do not oceur together either in a row or inacolumn. The 
variance of the difference is 


e 


- (+r + #)I 
8. As is typical of these designs, the analysis does not provide an exact 


F-test. The treatments m.s. in the analysis (table 12.4) cannot be com- 


pared with the error m.s., since the former has not been adjusted for row 


and column effects. For an approximate F-test, compute the sum of 
squares of deviations of the adjusted treatment totals in table 12.3. This 
comes to 1606.05. Division by 3, since there are 3 replicates, gives 
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535.35, with a mean square of 22.31. This may be compared with the 
effective error m.s., which is 


w, [1+ OR eae | = 987[1 += @1s2) | = 12.73 
: k+1 Bs lee Cas : 


The F-ratio is 22.31/12.78, or 1.75, with 24 and 24 df. 

9. The gain in precision relative to randomized blocks is estimated as 
follows. If the experiment were analyzed as randomized blocks, rows 
and columns would be amalgamated with error, giving 


df. 8.8. ms. 
Replications 2 546.88 
Treatments 24 611.09 
Error 48 1053.63 21.95 


This error, 21.95, is compared with the effective error m.s., 12.73. The 
relative information is estimated as 21.95/12.73, or 172%. Thus 3 
replicates of the lattice square appear about as precise as 5 with ran- 
domized blocks. 

Often a comparison with a lattice design will be of more interest. This 
experiment could have been planned as a triple lattice, and in that event 
the rows would probably have been chosen as blocks. The intra-block 
error would be derived from the pooled sum of squares for columns and 
error in table 12.4. The mean square would be 13.00, with 36 d.f. Con- 
sequently, for the triple lattice we have H, = 48.80 (m.s. for rows), H,. = 
13.00. The effective error m.s. (section 10.28) is 


rh fe r(E»y — E,) 
z.[14+ | -#,|1 ORE aa 


3(48.80 — 13.00) 
(6) (2) (48.80) 


The relative accuracy of the lattice square to the triple lattice is 
estimated as 15.38/12.73, or 121%. ‘ 

An account of the theory and a worked example are given by Yates 
(12.1) for balanced squares and by Cochran ( 12.2) for partially balanced 
squares. 


13.00 [i + | = 15.38 


12.22 Designs with (k + 1) Replications. In this case the analysis is 
slightly different, since it is possible to apply a single adjustment for all 
the rows in which a treatment lies, instead of making a separate adjust- 
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TABLE 12.5 PrrcenTAGES OF SQUARES ATTACKED BY BOLL WEEVILS FOR A 
4X 4 LaTrice SQUARE IN (k + 1) REPLICATIONS 


Boll weevil infestation 
Row 
Square I totals 


(10) 9.0 (12) 20.3 (9) 17.7 (11) 26.3 73.3 
(2) 4.7 (4) 9.0 @) 73 (3) 8.8 29.3 
(14) 9.0 (16) 6.7 (13) 11.7 (15) 4.3 31.7 
(6) 4.0 (8) 5.0 (5) 5.7 (7) 14.3 29.0 


Column totals 26.7 41.0 42.4 53.2 | 163.3 


Square IT 


(5) 19.0 (12) 8&7 (15) 13.0 (2) 15.7 56.4 
(10) 12.0 (7) 6.0 (4) 15.3 (18) 12.0 45.3 
(16) 12.7 (Ql) 63 (6) 17 (11) 13.0 33.7 

(3) 3.7 (14) 3.7 (9) 8.0 (8) 13.3 28.7 


Column totals 47.4 24.7 38.0 54.0 | 164.1 


Square III 


(10) 17.0 (15) 7.0 (8) 10.3 (1) 1.8 35.6 

(9) 11.38 (16) 12.3 (7) 3.0 (2) 5.3 31.9 
(12) 12.3 (13) 8.7 (6) 80 (8) 9.3 38.3 
(11) 30.8 (14) 22.3 (5) 11.0 (4) 12.7 76.3 


Column totals 70.9 50.3 32.3 28.6 | 182.1 


Square IV 


(ie) 50 (12) 103 (8) 57 (4) 127 | 33.7 
Ql) 27 (18) 67 (@) 103 (%) 57 | 25.4 
@) 10 @) 103 (9) 113 (13) 1L7 | 348 
(@ 11.0 () 19.0 (14) 20.7 (10) 29.7 | 80.4 


Column totals 19.7 46.3 48.0 59.8 | 173.8 


Square V 


(8). 2.0 (16) 5.0 (5) 4.0 (10) 13.7 24.7 
6) 9.3 Q) 17 (4) 6.3 (15) 12.8 29.6 
(12) 16.7 () 438 (14) 18.7 Q) 8&7 48.4 
(18) 16.7 (2) 30.0 (11) 25.7 (8) 14.0 86.4 


Column totals 44.7 41.0 54.7 48.7 | 189.1 
872.4 


354 LATTICE SQUARES 12.22 


ment for every row. On the whole, however, the analysis is more com- 
plex than with (k + 1)/2 replications. 

The example is a 4 X 4 lattice square conducted by the U. 8. Bureau 
of Entomology and Plant Quarantine at Florence, $. C., and described 
by Wadley (12.3). The treatments were 16 arsenical insecticides applied 
to cotton with a hand dusting machine. Plots were 10 rows wide and 
70 feet long, being about 14g acre. To allow for border effects, records 
were taken only from the 4 center rows. The data in table 12.5 are the 
percentages of squares * showing attack by boll weevils. These figures 
were obtained by examining 100 squares per plot, 25 from each of the 
4 center rows. Such counts were made at intervals during the summer: 
the data are averages from 3 counts made in August. 

The simplest computational routine, devised by Yates (12.1), will be 
presented here, though it is not too well adapted for making clear the 
meaning of the various steps. Yates’s paper should be consulted for an 
account of the theory. 


TABLE 12.6 OnriGIvaL AND ADJUSTED TREATMENT TOTALS 


Treat- Adj. 

ment treat- 
totals, ment Adj. 
T Re Ct D L! J K M’ totals means 
1 24.6 181.3 164.1 17.2 64.3 81.5 133.1 150.3 32.24 6.45 
2 74.7 284.4 196.6 87.8 —250.8  —163.0 100.4 188.2 68.41 13.68 
3 33.6 146.4 2219 —75.5 274.8 199.3 — 27.2 102.7 43.64 8.73 
4 56.0 214.2 222.1 — 7.9 25.4 175 — 62 = 141 56.79 11.36 
5 60.0 220.7 2231 -24 -— 311 — 33.5 — 40.7 — 43.1 47.20 9.44 
6 34.0 211.0 161.4 49.6 — 46.6 3.0 151.8 201.4 37.89 7.58 
7 33.3 180.0 211.0 —31.0 105.6 746 — 184 — 49.4 36.85 7.37 
8 483 2134 2240 -106 — 14 -— 120 -— 438 — 544 46.58 9.32 
9 50.0 197.8 240.3 42.5 83.4 40.9 — 86.6 -—129.1 50.07 10.01 
10 814 259.3 253.5 58 — 985 -— 927 -— 753 — 69.5 74.57 14.91 
1 98.0 295,1 252.5 426 —2111 —1685 — 40.7 1,9 87.95 17.59 
12 68.3 250.1 227.6 225 —1049 -— 824 — 14.9 7.6 63.51 12.70 
13 60,8 236.0 2512 —15.2 — 644 -— 796 125.2 —140.4 53.45 10.69 
14 TA4 265.5 204.4 61.1 —157.5 — 96.4 86.9 148.0 71.36 14.27 
15 43.3 178.7 236.5 —57.8 152.1 943 — 79.1 —136.9 46.42 9.28 
16 41.7 155.7 199.4 43,7 260.7 217.0 85.9 42.2 55.46 11,09 
Totals 872.4 3489.6 3489.6 0 0 0 0 i) 872.39 174.47 


1. Find the row, column, replication, and grand totals, all shown in 
table 12.5. 
2. Find the treatment totals 7’ (table 12.6). For each treatment find 


F; = total of all rows in which the treatment appears 
C; = total of all columns in which the treatment appears 


*A “square” is the name given to the young flower bud. 
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It may save time to find all 3 quantities, 7, Ry, and C;, by simultaneous 
addition, since this means that the positions of a treatment in the dif- 
ferent replicates are found only once. This can be done if the machine 
carriage is moved so as to accommodate 3 sets of running totals. The 
sum of the R; values and the sum of the C; values should each equal k 
times the sum of the 7’ values. 

Thereafter the successive columns in table 12.6 are filled out as follows. 


Die hy Cs 
UV=kT-—(k+)DR+¢4 
Je Dit Li 


K=J+(k—D 
M=D+K 


Note that the total of each of these quantities is zero. 

3. Compute the analysis of variance. The only sums of squares re- 
quiring special instructions are those for rows and columns. After ad- 
justment for treatment effects, it happens unfortunately that the sums 
of squares for rows and columns are not mutually orthogonal. By a 
well-known result in the analysis of variance, their combined sum of 
squares may be expressed in either of two ways: 


Rows (adj. for treatments) + columns (adj. for treatments and rows) 
= columns (adj. for treatments) + rows (adj. for treatments and 


columns) 


Both expressions are computed. 


Sum of squares for rows adjusted for treatments 
n2 2 2 ari 2 
2 S(L’y & (64.3)? + (250.8)? ++ +++ (260.7) = 1093.02 
Be +1) 320 


Sum of squares for rows adjusted for treatments and columns 


2 Bees I 
S07?) _ (81.5)? + (163.0) +--+ 217.0" _ 1996.76 


“Be 1D 192 
A say? 
Sum of squares for columns adjusted for treatments = Pe +D = 625.85 
Sum of squares for columns adjusted for treatments and rows 
SCR) _ 559.50 


~ 78k — 1) 


356 LATTICE SQUARES 12.22 
Note that 1093.02 + 559.59 = 1026.76 + 625.85. 


TABLE 12.7 ANALYSIS OF VARIANCE 


df. 8.8. m.5. 
Replications k 4 31.56 7.89 
Treatments P-1 15 1244.20 82.95 
Rows (adj. for treatments) 1 15 1093.02 
Rows (adj. for tr. and col.) R-1 15 1026.76 68.452, 
Columns (adj. for tr.) RP-1 15 625.85 
Columns (adj. for tr. and rows) P-1 15 559.59 37.31E. 
Error (? — )(k — 2) 30 680.17 22.67H, 
Total B+hR—-1 79 3608.54 


In finding the error s.s. by subtraction, note that we subtract the com- 
bined effect of rows and columns only once. 
4, This leads to the adjusted treatment totals. Compute 


q = HH, — B,? = (16) (68.45) (37.31) — (22.67)? = 40,348 
Q = (k — 1g = 121,044 
_ &, — B)(kE. — E,) _ (45.78) (126.57) _ 


V s = 0.04787 
Q 121,044 
E, — E.)(kE, — B.) (14.64) (251.13 
fie ( ) ) we )¢ ) = 0.03037 
Q 121,044 


The adjusted treatment totals are 
T+NL' + p'M’ 
and are inserted on the right of the M’ column in table 12.6. For the 
means, divide by r. 
5. The error variance of the difference between two adjusted means is 


2H, 2(22.67 
nn [L + kA’ + p’)] = o— {1 + 4(0.07824)] = 11.9 


To obtain an approximate F-test of the treatments, compute the sum 
of squares of deviations of the adjusted treatment totals, and divide by 
(k + 1)(&? — 1) to give the mean square on a single unit basis. This is 
tested against the effective error m.s., H,[1 + k(\’ + y’)]. 

The gain in precision relative to a randomized blocks or lattice design 
is estimated by the procedure for the lattice square with (k + 1)/2 
replications. 


ly 
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12.23 Designs with 2(k +1) Replications. This case is unlikely to 
be of much practical interest except possibly with the 3 x 3 design in 
8 replicates and the 4 X 4in 10. The analysis is similar to that for (k + 1) 
replications; the changes will be noted very briefly. 

The partition of degrees of freedom is as follows. 


df. 

Replications (2k + 1) 
Treatments (k2 — 1) 
Rows 

Component (a) (k? — 1) 

Component (0) ( — 1) 
Columns 

Component (a) (k? — 1) 

Component (b) (2 — 1) 
Error (2k — 3)(k? — 1) 

Total 2h(k +1) — 1 


Corresponding to any row, there is another row with the same set of 
treatments. Find the differences D, between the totals of corresponding 
rows. The sum of squares of deviations of the D, from their replication 
means, divided by 2k, gives component (a) of the rows s.s. Component 
(b) corresponds to the component that was obtained with (k& + 1) rep- 
licates. As before, it has two forms, an L’ andaJ form. Both are found 
in the same way as with (& + 1) replications, except that the divisors 
must be doubled. The sums of squares for columns are derived similarly. 

The formulae for the adjustment factors are a little more complicated. 
Let H, be the mean square found by pooling component (a) and the J 
component of the rows 8.5. with an analogous definition for EZ. Cal- 


culate 
1 (Oh = a)une 2 (2k) 
‘Mee: We oun, — 2B,’ ° 2kE, — Be 
rf. (W; — W,) Wi — We 
z 5 ane 
KW, + We + (& — 1)Wi) KW, + We — (k — 1)Wid 


The remainder of the analysis proceeds without change. 


12.24 Missing Data. The procedure for analyzing lattice squares 
when certain observations are missing has been worked out by Cornish 
(12.4). As in the case of lattices (section 10.13), the method requires 
two different estimates of each missing observation, one being that found 
by minimizing the jntra-row-and-column error, the other being that ap- 


propriate to the case where the data are analyzed as a randomized block 
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experiment. In order to reduce the amount of computation, we present 
a cruder method which should be satisfactory if only a small fraction of 
the total observations is missing. 

In this approach we estimate each missing observation by the first: 
method mentioned above, that is, by minimizing the intra-row-and- 
column error s.s. Thereafter the analysis proceeds as usual: 1 df. is 
deleted from the total and intra-row-and-column error s.s. for each miss- 
ing value. Special rules for ¢-tests are given later. 

The following totals are used in the formulae for inserting estimates 
in place of the missing values. R, C, 7, and P are the totals of the row, 
column, treatment, and replication, respectively, that contain the miss- 
ing observation, while G is the grand total. Further, 


Sz = total of all other rows and columns in which the treatment with 
the missing value appears (note that this sum does not in- 
clude the row and column which contain the missing value) 

T = total (from all replicates) of all other treatments which appear 
in the row or column that contains the missing value 

Z=kR, — Py + kCy — Po 


With (& + 1) replicates, there is one other row with exactly the same 
treatments as C. Let R, be its total and P, the total of the replicate 
containing R,, with similar definitions for Cy and P:. With (k + 1)/2 
replicates, Z is not used. 

Experiment with (k + 1)/2 replications 
Nex k(k — 1)(R + C) — (k+3)P + 2k(k — 2)T + 6G — 2hS, — 2hT, 

(k — 1)°(k — 8) 
Experiment with (k + 1) replications 


ye kk —W(R+C)—(h+)P+ kk —2)7 +34 —kS, — kT, +2 
(k — 1)?(k — 2) 
Experiment with less than (k + 1)/2 replications. Because of the lack 
of symmetry in this design, an additional symbol is needed. 


U, = total (from all replicates) of all treatments that appear in the 


same row or column as the treatment that has the missing 
value. 


The distinction between U, and T,, should be realized, T. is a total 
over only those other treatments that are in the actual row or column 
that contains the missing value, while U; is a total over all other treat- 
ments that appear somewhere in the experiment in the same row or 
column as the treatment that has the missing value. Thus 7’, is a part 
of U,. It may help to note that 7, is a total over 2(k — 1) treatments, 


4 
| 
| 
| 
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or 2r(k — 1) observations, where r is the number of replications, while 
U; is a total over 2r(k — 1) treatments, or 2r?(k — 1) observations. 
[ere —1)\(R+0C)-rr7+ DP 4+ kk — 2)(r — a 
| +4@+ 0 — krS, — brT, + kU, 
(k — 1) — I)(kr —k — r — 1) 

Example. In the corn experiment in table 12.3, suppose that treat- 
ment (18) in replication I is missing. Here k = 5, r = (k + 1)/2 = 3. 
Omitting the observation 33.3 for this treatment, we find 

R=1258; C=119.9; R+C = 245.7; P = 721.9 
T = 56.5; G = 2156.1 

S, is the total of the rows and columns in which treatment (18) appears 

in squares II and III. Thus 

Se = 165.1 + 145.3 + 109.8 + 126.0 = 546.2 
Finally, 7, is the total for treatments (2), (9), (1 1), and (25), which ap- 
pear in the row that has the missing value, plus that for treatments (5), 
(6), (12), and (24) which appear in the column with the missing value. 
These totals are to be taken from the table of treatment totals. This 
gives 7, = 709.2. Then 


(20) (245.7) — (8)(721.9) + (80) (56.5) 
+ (6)(2156.1) — (10) (546.2) — (10)(709.2) 
w 32 38.0 


The exact formulae for tests are complicated, and it appears that any 
good approximate rule would also be rather complicated. The following 
rule is suggested, though it somewhat underestimates the standard 
errors for the smaller squares and also lacks the simplicity that might be 
desired. For a comparison between the means of two treatments A and 
B, we assign to each an effective amount of replication which depends on 
the number and situation of the missing values. When scoring A we 
examine each replicate in turn and assign a score by the following rules. 


Case Score to A 
I. A missing 0 
II. A present, and B in the same row or column as A 
i. B missing 0 
ii. B present fi 
III. A present, and B not in the same row or column as A 
j. Other values missing in both row and column My 
ii, Other values missing in row or column but not in 
both oP 


iii. No values missing in row or column 


360 LATTICE SQUARES 12.24 


The effective replication for A is of course its total score over all rep- 
lications in the experiment. B is scored similarly. Suppose that in the 
example the mean of treatment (18) were being compared with that of 
treatment (9), which occurs in the same row in square I. The effective 
replication for treatment (18) is 2, since case I arises in square I. The 
effective replication for treatment (9) is also 2, since case IT(i) arises in 
square I. 
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PLANS 
Plan 12.1 3 X 3 balanced lattice square 
t= 9, k =3,r = 2, rows = 6, columns = 6, \ = 1* 
Square I Square II 
1 2 3 1 6 8 
45 6 9 2 4 
i Seg 5 7 3 
Plan 12.2 4 X 4 balanced lattice square 
t= 16,k =4,r =5, rows = 20, columns = 20, \ = 2 
Square I Square II Square IIT 
il. Bibte (Oy 18 tn eet eis #4: Det 6) 6 
2 6 10 14 Geer. 87, I2sg22) 515 
Bare LL 1s 1 12 9 10 Naess 359 
4 8 12 16 1615 14 13 7 13 «10 «4 
Square IV Square V 
1 7 12 14 LORS eB: 
8 2 13 Il CY 
Ae en igh 16> 312 
15: “SONGS Pa Eure 


* Number of times that two treatments appear in the same row or column. 
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Plan 12.3 5 X 5 balanced lattice square 
t = 25, k =5,r = 3, rows = 15, columns = 15,4 = 1 
Square I Square IT Square IIT 
1-28 1 10 14 18 22 1 8 alba Tired 
6. 7 eB OO, Q3= 12) iGieto: A9) 25) 92) Dees 
It 120 V3 S1ae AS. 20 E24 Tia GAL 19 21 3 10 12 
16: 17 18 195220) 12 16 2 4 8 18 20 22 4 6 
21 22 23 24 25 9 138 17 21 5 7 14 16 23 5 
Plan 12.4 7 X 7 balanced lattice square 
t = 49,k = 7,7 =4, rows = 28, columns = 28,4 = 1 
Square I Square IT 

Fe ee ee eS i 1 38 26 14 44 32 20 

8 9 30) Th 12) 13) 14 21° 2 39 27. 8 45 88 

15 16 17 18 19 20 21 34 15 8 40 28 9 46 

22 23 24 25° 26 27 28 47 35 16 4 41 22 10 

29 30 31 32 33 34 35 11 48 29 17 5 42 28 

36 37 38 39 40 41 42 24 12 49 30 18 6 36 

43 44 45 46 47 48 49 37 25 18 48 31 19 7 

Square III Square IV 

1 19 30 48 10 28 39 1 42 27 12 46 31 16 

40 2 20 31 49 ll 22 17. 2 36 28 13 47 32 

23 41 38 21 32 43 12 33 18 3 37 22 14 48 

138 24 42 4 15 33 44 49 34 19 4 38 23 8 

45 14 25 36 5 16 34 9 48 35 20 5 39 24 

35 46 8 26 37 6 17 25 10 44 29 21 6 40 

18 29 47 9 27 38 7 41 26 11 45 30 15 7 
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8 X 8 balanced lattice square 


t = 64, k = 8, r = 9, rows = 72, columns = 72, \ = 2 


Square I 


Square IT 


Square IIT 


1 917 25 33 41 49 57 


1 10 19 28 37 46 55 64 


1 44 62 56 27 39 18 13 


2 10 18 26 34 42 50 58 


9 2 51 44 61 30 23 40 


46 217 35 16 53 60 31 


3 11 19 27 35 43 51 59 


17 50 3 36 29 62 15 48 


64 23 3 25 54 12 45 34 


4 12 20 28 36 44 52 60 


25 42 35 4 21 14 63 56 


55 40 29 4 58 41 11 22 


5 13 21 29 37 45 53 61 


33 58 27 20 5 54 47 16 


28 9 50 63 5 24 38 43 


6 14 22 30 38 46 54 62 


41 26 59 12 53 6 39 24 


37 51 15 42 20 6 32 57 


7 15 23 31 39 47 55 63 


49 18 11 60 45 38 7 32 


19 61 48 14 33 26 7 52 


8 16 24 32 40 48 56 64 


57 34 43 52 13 22 31 8 


10 30 36 21 47 59 49 8 


Square IV 
1 60 54 40 43 15 26 21 


Square V 


Square VI 


1 11 20 30 34 48 53 63 


1 59 52 38 42 16 29 23 — 


62 2 25 11 24 37 52 47 


15 2 56 45 59 28 22 33 


63 2 32 13 19 36 54 41 


56 31 3 41 38 20 61 10 


21 52 3 39 32 58 9 46 


53 28 3 47 40 18 57 14 


39 16 45 4 50 57 19 30 


26 47 38 4 17 13 64 51 


34 15 46 4 49 61 24 27 


44 17 34 55 5 32 14 59 


40 62 31 19 5 49 42 12 


48 22 39 51 5 25 10 60 


13 35 23 58 28 6 48 49 


43 25 61 16 55 6 36 18 


11 33 21 64 31 6 44 50 


27 53 64 22 942 7 36 


54 24 10 57 44 35 7 29 


30 56 58 17 12 43 7 37 


18 46 12 29 63 51 33 8 


60 37 41 50 14 23 27 8 


20 45 9 26 62 55 35 8 


Square VIT 


Square VIIT 


Square IX 


1 32 47 61 22 50 12 35 


1 14 24 31 36 45 51 58 


Ue2i3 45076 7 8 


29 2 14 49 39 64 43 20 


12 2 55 48 57 27 21 38 


1412161015 9 13 11 


4213 3 24 60 33 30 55 


22 49 3 37 26 63 16 44 


24 21 22 23 18 19 20 17 


59 5418 4 48 31 37 9 


32 43 33° 4 23 10 62 53 


31 27 26 32 30 29 25 28 


23 36 57 46 5 11 56 26 


35 64 30 18 5 52 41 15 


36 38 37 33 35 34 40 39 


52 63 40 27 10 6 17 45 


47 29 60 9 56 6 34 19 


45 48 44 43 41 47 46 42 


16 41 28 34 51 21 7 62 


50 20 13 59 46 40 7 25 


51 55 49 53 52 56 50 54 


38 19 53 15 25 44 58 8 


61 39 42 54 1117 28 8 


58 57 63 62 64 60 59 61 
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Plan 12.6 9 X 9 balanced lattice square 
t = 81, k = 9, r = 5, rows = 45, columns = 45, 4 = 1 
Square I Square IT 
1 ‘2. Vet: Gee Gre sao 1 12 20 34 45 53 58 69 77 
10: TL 12°43) Say AS SIGHTS As: 21 2 10 54 35 48 78 59 67 
19 20 21 22 23 24 25 26 27 11 19 3 44 52 36 68 76 60 
28 29 30 31 32 33 34 35 36 61 72 80 4 15 28 28 39 47 
87 38 39 40 41 42 43 44 45 81 62 70 24 5 18 48 29 37 
46 47 48 49 50 51 52 53 54 71 79 68 14 22 6 38 46 30 
55 56 57 58 59 60 61 62 63 81 42°50 55 66) 74 07 18) 26 
64 65 66 67 68 69 70 71 72 51 32 40 75 56 64 27 8 16 
73 74 75 76 77 78 79 80 81 41 49 33 65 73 57 17 25 9 
Square IIT Square IV 
1 57 29 16 72 44 22 78 50 1 33 62 27 47 76 14 43 66 
30 2 55 45 17 70 51 23 76 638 2 31 77 25 48 64 15 44 
56 28 3 71 43 18! 77 49-24 32 61 3 46 78 26 45 65 13 
25 81 53 4 60 32 10 66 38 17 37 69 4 86 56 21 50 79 
54 26 79 338 5 58 39 11 64 67 18 38 57 5 34 80 19 51 
80 52 27 59 81 6 65 37 12 39 68 16 35 55 6 49 81 20 
138 69 41 19 75 47 7 63 35 24 53 73 11 40 72 7 30 59 
42 14 67 48 20 73 36 8 61 74 22 54 70 12 41 60 8 28 
68 40 15 74 46 21 62 34 9 52 75 23 42 71 10 29 58 9 
Square V 
1 60 35 18 65 40 23 79 48 
36. 2 58 41 16 66 46 24 80 
59 34 3 64 42 17 81 47 22 
26 73 51 4 68 29 12 68 48 
49 27 74 30 5 61 44 10 69 
75 50 25 62 28 6 67 45 11 
15 71 37 20 76 54 7 57 32 
38 13 72 52 21 77 33 8 55 
70 39 14 78 53 19 56 31 9 
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11 X 11 balanced lattice square 


¢ = 121, k = 11,r = 6, rows = 66, columns = 66, \ = 1 


Square I 


5 


6 


16 


17 


27 


28 


38 


49 


60 


71 


82 


“93 


94 


95 


104 105 106 


110 


15 116 117 


121 


Square IT 


42 


22 


112 


63 


43 


12 


84 


64 


44 


85 


65 


115 


106 


116 


110 


121 


111 


Square IIT 


83 


76 


58 


91 


106 99 


114 107 


PLANS 
Plan 12.7 (Continued) 11 X 11 balanced lattice square 


Square IV 
1 40 68 107 14 53 81 120 27 66 94 
95 2 41 #69 108 15 54 82 121 28 56 
57 «(96 3 42 70 109 16 55 83 111 29 
30 58 97 4 43 71 110 17 45 84 112 


48 87 115 33 61 89 7 #35 74 102 20 


84 70 56 53 39 25 22 8 115 101 98 
99 85 71 57 54 40 26 12 9 116 102 


121 98 75 52 29 


6 
30 

87 1412 80 A BL. 8 106 83 60 
67 


8 62 39 16 114 91 68 45 33 10 108 
joo 86 63 40 17 115 92 69 46 23 il 
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Plan 12.8 13 X 13 balanced lattice square 
t = 169, k = 13, r =7, rows = 91, columns = 91, = 1 
Square I 

‘LPR QeeLS) PELL RG a TOO AOL LE) 2" IS 
14) 5! 16) 7 TAS 19) 20) 2122) 123) 24 25) 26 
27 «228 «629: «30 «81S 8288) 84 85 86 87 8888 
40 41 42 48 44 45 46 47 48 49 50 51 52 
a a 
ChMOT. LOS aOH Oe So) BEADED OG T7778 
79 90 81 82 8 84 8 86 87 88 89 90 9 
92 938 94 95 96 97 98 99 100 101 102 103 104 
105 106 107 108 109 110 111 112 113 114 115 116 117 
118 119 120 121 122 123 124 125 126 127 128 129 130 
131 132 133 134 135 136 137 138 139 140 141 142 143 
144 145 146 147 148 149 150 151 152 153 154 155 156 
157 158 159 160 161 162 163 164 165 166 167 168 169 

Square IL 

1 2% 388 50 62 74 86 98 110 122 184 146 158 
159 2 #14 #39 #61 G8 75 87 99 111 123 185 147 
148 160 38 15 27 52 64 76 88 100 112 124 136 
1387 149 161 4 16 28 40 65 77 89 101 113 125 
126 1388 150 162 5 17 29 41 53 78 90 102 114 
115 127 139 151 168 6 18 30 42 54 66 91 1038 
104 116 128 140 152 164 7 19 31 43 55 67 179 
80 92 117 129 141 153 165 8 20 382 44 56 68 
69 8i 93 105 130 142 154 166 9 21 338 45 57 
58 70 82 94 106 118 143 155 167 10 22 34 46 
47 569 71 .83 95 107 119 131 156 168 11 23 35 
36 48 60 72 84 96 108 120 132 144 169 12 24 
25. 37 «49 «661 «6738 «685 «97 «109 121 133 145 157 13 
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PLANS 


13 X 18 balanced lattice square 


Square IIT 
1 24 34 44 54 77 87 97 107 130 140 150 160 
161 2 2 385 45 55 78 88 98 108 118 141 151 
152 162 3 2% 36 46 56 66 89 99 109 119 142 
143 153 163 4 14 237 #47 «+57 67 90 100 110 120 
121 131 154 164 5 6 38 148 958 68 01 101 iit 
112 122 132 155 165 6 16 39 49 59 69 79 102 
103 113 123 133 156 166 7 17 12 150) “60! -70" 280: 
81 104 114 124 134 144 167 as a | 
72 82 92 115 125 1385 145 168 9 19 29 52 62 
638.73 83 93 116 126 136 146 169 10 20 30 40 
41. 64 74 &4 94 117 127 187 147 157 11 21 31 
32 42 65 75 85 95 105 128 188 148 158 12 22 
23. 383 43 «+53 76 86 96 106 129 139 149 159 18 
Square IV 

1 22 30 51 59 67 88 96 117 125 133 154 162 
163 2 23 31 52 60 68 89 97 105 126 134 155 
156 164. «3 «24 32 40 61 69 90 98 106 127 135 
736 144 165 4 2 338 41 62 70 M1 99 107 128 
129 137 145 166 5 26 34 42 63 71 79 100 108 
109 130 138 146 167 6 14 35 43 64 72 80 101 
102 110 118 139 147 168 7 15 36 44 65 73 81 
82 103 111 119 140 148 169 8 16 87 45 58 74 
7 93 104 112 120 141 149 157 9 17 88 46 (54 
$B 76 84 92 118 121 142 150 1588 10 18 39 47 
48. 56 77 85 98 114 122 1438 151 159 11 19 27 
28 49 «+57 78 86 94 115 123 131 152 160 12 20 
21. 29 «+50 58 66 87 95 116 124 182 153 161 18 
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13 X 18 balanced lattice square 


Square V 
1 20 39 45 64 70 89 95 114 120 189 145 164 
165 2 21 27 46 65 71 90 96 115 121 140 146 
147 166 3 22 28 47 #53 72 O91 97 116 122 141 
142 148 167 4 23 20 48 54 73 79 98 117 123 
124 143 149 168 5 “A 30 49 55 74 80 99 105 
106 125 181 150 169 6 25 31 50 56 75 81 100 
101 107 126 132 151 157 We 426° WBZ BL ST 76 «82 
83 102 108 127 1388 152 158 8 14 33 52 58 77 
78 84 108 109 128 134 153 159 9 15 34 #40 59 
60 66 85 104 110 129 1385 154 160 10 16 35 41 
42 61 67 86 92 111 130 1386 155 161 11 17 36 
37. 43 « -62)—« 68s 87)~=— 938s“: 112 «118 187 156 162 12 18 
19 38 44 63 69 88 94 113 119 138 144 163 13 
Square VI 

1 18 35 52 56 73 90 94 111 128 1382 149 166 
167 2 19 36 40 57 74 1 95 112 129 133 150 
151 168 3.20 37 #41 #58 75 79 96 113 130 134 
135 152 169 4 21 38 42 59 76 80 97 114 118 
119 186 153 157 5 22 39 43 60 77 81 98 115 
116 120 187 154 158 6 23 27 44 61 78 82 99 
100 117 121 138 155 159 7 24 28 45 62 66 88 
84 101 105 122 189 156 160 8 25 29 46 63 67 
68 85 102 106 128 140 144 161 9 26 30 47 64 
65 69 86 103 107 124 141 145 162 10 14 31 48 
49 53 70 87 104 108 125 142 146 163 11 15 32 
33. 50 54 71 88 92 109 126 143 147 164 12 16 
W734 451 #55 72 89 98 110 127 181 148 165 13 


Plan 12.8 (Continued) 


13 X 13 balanced lattice square 


PLANS 


Square VIT 

1 16 31 46 61 76 91 93 108 123 188 153 168 
169 2 17 #32 #47 #62 %77 j%79 94 109 124 139 154 
155 157 3 18 33 48 63 78 980 95 110 125 140 
141 156 158 4 #19 #34 #49 64 66 81 96 111 126 
127 142 144 159 5 20-35 50 65 67 82 97 112 
113 128 143 145 160 6 21 36 51 53 68 83 98 
99 114 129 131 146 161 7 #22 37 «+52 54 69 84 
85 100 115 1380 182 147 162 8 23 38 40 55 70 
71 86 101 116 118 183 148 168 9 24 39 41 56 
57 #72 87 102 117 119 134 149 164 10 25 27 42 
43 58 73 88 103 105 120 135 150 165 11 26 28 
29 44 59 74 89 104 106 121 136 151 166 12 14 
15 30 45 60 75 90 92 107 122 187 152 167 13 
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CHAPTER 13 


INCOMPLETE LATIN SQUARES (YOUDEN SQUARES) 


13.1 Description 


13.11 Youden Squares. These designs, which are constructed by a re- 
arrangement of certain of the balanced incomplete blocks, possess the 
characteristic “double control” of the latin square, without the restric- 
tion that the number of replicates must equal the number of treatments. 
A latin square with 13 treatments is rarely used, because it necessitates 
13 replications. There are, however, incomplete latin squares for 13 
treatments in either 4 or 9 replicates. Every treatment occurs once in a 
column (replication), and every pair of treatments appears together an 
equal number of times in the same block. Column differences are 
eliminated automatically from the treatment comparisons, while block 
differences may be removed by adjusting the treatment mean yields in 
the same way as with balanced incomplete blocks. Most of the designs 
were developed by Youden (13.1, 13.2), whose name is commonly as- 
sociated with them; previously Yates (13.3) had drawn attention to the 
group of designs in which the number of replicates is one less than the 
number of treatments. Table 13.1 summarizes the designs for numbers 
of treatments up to 37. 


TABLE 13.1 Summary oF iNcoMPLETE LATIN SQUARES (7 less than ¢) 


t prek 2X E Typet t r=k »X E Type t 
4* 3 2 89 IL 13 4 1 81 I 
5* 4 3 94 IL 13 9 6 95 I 
6* 5 4 96 IL 15 7 3 92 I 
uf 3 1 78 IL 15 8 4 94 I 
fe 4 2 88 IL 16 6 2 89 I 
ved 6 5 97 II 16 10 6 96 I 
8* cf 6 98 IL 19 9 4 94 I 
g* 8 “i 98 IL 19 10 5 95 I 
10 * 9 8 99 Il 21 5 il 84 I 
Vik 5 2 88 I 31 6 1 86 it 
11 6 3 92 I 37 9 2 91 I 


ies. 16 9 OS Il 


* Constructed from a t X ¢ latin square by omission of the last column, 
} This refers to the method of analysis (see section 13.2). 
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The number of replicates is always equal to the number of units per 
block. The symbol \ shows the number of times that two treatments 
appear together in a block, while Z is the efficiency factor expressed in 
percentage. 

Plans are given (plans 13.1—-13.15) for all designs in table 13.1 except 
those marked with an asterisk (*), which are constructed from latin 
squares by the omission of the last column. A 5 X 5 square, for in- 
stance, furnishes the plan for ¢ = 5, r = 4. 

An interesting application of the design for the control of plant vari- 
ability was made by Youden (13.1) in greenhouse experiments on to- 
baceco-mosaic virus. The experimental unit was a single leaf, and the 
data consisted of the number of lesions produced per leaf by rubbing the 
leaf with a solution which contained the virus. The numbers of lesions 
had been found to depend much more on inherent qualities of the plant 
than on the position of the plant on the greenhouse bench. Conse- 
quently, each block of the design was a single plant, so that the large dif- 
ferences in responsiveness which existed among plants did not contribute 
to the experimental errors. The columns were the positions, from top 
to bottom, of the five leaves which were used on each plant. That: is, 
the first replication contained the top leaf of every plant. Since there 
was a fairly consistent gradient in responsiveness down each plant, this 
control also proved effective. 

As in the example above, when laying out an incomplete latin square 
the general principle is to group the units so that differences among 
blocks and differences among columns represent the major sources of 
variation that are known or suspected. 


13.12 Randomization. The steps are 


1. Rearrange the blocks of the plan at random. 
2. Rearrange the replications of the plan at random. 


When a plan is repeated for additional replication, blocks and columns 
are randomized separately within each repetition. The repetitions may 
be kept separate. 

13.2 Statistical Analysis 


In experiments with more than 10 treatments, the numbers of degrees 
of freedom for blocks are large enough to allow the use of inter-block in- 
formation. The appropriate analysis is described in the next section. 
For small experiments, where inter-block information should be ignored, 
the analysis is given in section 13.22. The recommended method of 
analysis for any experiment is indicated both in table 13.1 and in the 


plans by the “type” symbol (I or II). 
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13,21 Type I. Analysis with Recovery of Inter-block Information. 
The steps are as follows (¢ = number of treatments, & = number of units 
per block = r). 

1. Calculate the treatment totals 7, the column totals, the block 
totals, and the grand total G. 

2, For each treatment, calculate the total B, of all the blocks which 
contain the treatment. Place these quantities in a column next to the 
column of treatment totals. Then forma third column of the quantities: 


W=(¢—-hKT-C¢-1B+(k-DE 


The W’s should sum to zero. 
8. The separation of degrees of freedom in the analysis of variance is 
as follows. 


df. m.s. 
Columns (replications) (k — 1) 
Blocks (eliminating treatments) (t — 1) Ep 
‘Treatments ¢—1) 
Error (k — 2)(¢ — 1) Es 
Total (tk — 1) 


The total s.s. and the sums of squares for columns and treatments are 
found by the usual methods. The sum of squares for blocks (eliminating 
treatments) is the sum of squares of the W’s, divided by kt(¢ — k)(k ~ 1). 
The error s.s. is obtained by subtraction. 
4, Calculate the factor 
aa. (EZ, — E) 
i(k — 1)Ey 


The adjusted total for any treatment is 
Y=7+.W 


Should F, be Jess than H., » is taken as zero and no adjustments are 
made to the treatment totals, The adjusted treatment means are ob- 
tained on dividing each Y by r. The estimated error variance of the dif- 
ference between two adjusted treatment means is 


2E, 
pelt + (t =r) 


13.22 Type II. Analysis without Recovery of Inter-block Information. 
The analysis proceeds as follows. 
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1. Calculate the column totals, the block totals, the treatment totals 
T, and the grand total G. 
2. For each treatment, obtain the quantity 


Q=kT —B, 


where B; is the total of all the blocks in which the treatment appears. 
The quantities Q should sum to zero. 

3. The degrees of freedom in the analysis of variance are partitioned 
asin section 13.21. In this case the blocks s.s. is calculated without elim- 
inating treatment effects, while the treatments s.s. is adjusted for block 
differences. The total s.s. and the sums of squares for blocks are com- 
puted by the standard procedure. The treatments s.s. (adjusted) is the 
sum of the squares of the Q’s, divided by tka. The error s.s. is found by 
subtraction. 

4, In order to obtain any treatment mean, adjusted for block dif- 
ferences, divide the corresponding Q by ‘ and add to the quotient the 
mean for the whole experiment. 

The estimated error variance of the difference between two adjusted 
treatment means is 


2E,.(t — 1) 
(r— Dé 


Numerical examples of this analysis for ¢ = 7, r = 3, and ¢ = 21, 
+ = 5 are given in reference (13.1), and an example for ¢ = 6, r = 5 in 


(13.8). 


18.23 Type Ia. Repetitions of Type I. Let n be the number of repli- 
cations in the basic plan, which is used p times in an experiment, so that 
the total number of replications 7 = np. 

The method of analysis presented here may be used with any of the 
designs in table 13.1, since even with small numbers of treatments the 
repetition provides sufficient degrees of freedom for estimating the inter- 
block variation. If, however, the efficiency factor exceeds 95%, it is 
scarcely worth while to utilize inter-block information. 

The changes necessary in the procedure of section 13.21 are outlined 
below, the numbers referring to the steps in that section. 

1. Unchanged. Corresponding to any block, there are (p — 1) other 
blocks which contain the same set of treatments. The block totals 
should be arranged in a table (table A, say) with ¢ rows and p columns. 

2. Unchanged. Note that the calculation of the quantities B; is ex- 
pedited by the use of the row totals of table A. 
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3. The analysis of variance is as follows. 


df. m.s. 
Columns (replications) (kp — 1) 
Blocks 
Component (a) r= HE 1) 
Component () ( —1) 
Block total p(t — 1) Ey 
Treatments (t—1) 
Error (t — 1)(pk — p — 1) E, 
Total (pkt — 1) 


In this analysis, the (py — 1) degrees of freedom among repetitions 
have been ascribed to the columns, so that the blocks s.s. is actually a 
sum of squares among blocks within repetitions. 

Component (a) of the sum of squares for blocks consists of comparisons 
among blocks which contain the same set of treatments and is the inter- 
action s.s. for table A. 

Component (b) is the sum of squares of the W’s, with a divisor 
pkt(t — k)(k — 1). 

4, Unchanged except that 

p(Ey — Ee) 
pik — 1), — (t— k)(p — 1)E, 

The estimated error variance of the difference between two adjusted 

treatment means is 


2E, 
=F (L + (@ — n)u] 


13.24 Type Ila. Repetitions of Type Il. As we have pointed out, 
in designs where the efficiency factor is high, inter-block information 
can be neglected. The changes required from section 13.22 should 
present no difficulty. The degrees of freedom subdivide as in section 
13.23. The p(t — 1) degrees of freedom among blocks within repetitions 
are calculated in the usual way, each repetition furnishing (f — 1) de- 
grees of freedom. Notice that the divisors for the treatments s.s. and 
for the quantities Q must be increased by the factor p. 


13.25 Missing Data. The formula for inserting an estimate in place 
of a missing observation is obtained by minimizing the intra-block error. 
This is the correct estimate for the smaller experiments where inter- 
block information is ignored. For reasons given in section 10.18, this 
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estimate will also be used to provide an approximate solution when 
inter-block information is recovered. The formula is quite similar to that 
for balanced incomplete blocks, with the addition of an extra term in- 
volving the column (or replication) total. 

Let C, B, and T be the totals of the column, block, and treatment that 
contain the missing value, and let G be the grand total. Further, as in 
the statistical analysis, let B, be the total of all blocks in which the 
treatment with the missing value appears (there will, of course, be r such 
blocks). Finally, 


7’ = total (over all replicates) of all other treatments that appear in 
the block which has the missing value 
B/ = total of the B; values for all other treatments that appear in the 


block with the missing value 
The estimate x of the missing value is 


MrC + 1B+ ¢-— DT -— G]—rT'-(r- 1)B, + By 
r(r — 1)(r — 2) 


The symbol A is given for each design in table 13.1. Care must be 
taken not to confuse the block totals with the column totals, 

If the experiment contains p repetitions, the formula is 

plpkaC + piddsB + k(k — 1)T — prR — kT! — (k — 1)Bi + Bi] 
2= 
r(k — 1)(pk — p — 1) 

where R is the total of the repetition in which the missing value occurs, 
and ) is as in table 13.1. 


143.3 Other Designs for Small Numbers of Treatments 


13.31 Description. When the number of treatments is small, it is 
sometimes useful to have a design of the “Jatin square” type in which 
the number of replicates exceeds the number of treatments. Some de- 
ined by repetition of an ordinary latin square or of a 
Youden square. Additional plans can be constructed by adding a You- 
den square to a latin square. A selection of these designs is shown in 
table 13.2 for numbers of treatments up to ae 

Tt will be noted that plans are available for most numbers of replicates ~ 
up to 10. Actually, plans can be made for any number of replicates up 
to 10; those shown here have been selected for ease of analysis. 


signs can be obtai 
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TABLE 13.2 INcoMPLETE LATIN SQUARES FOR SMALL NUMBERS OF TREATMENTS 


(r >t) 

Ref. to Ref. to 
t k th E plan Type ft t EF E plan Type ft 
3 5 5 965" 13.165 "TTF 4 9 9 99 ©6«:13.22) IV 
3 3 6 100 2LS. 4 5 610 96 13.20 Ta 
3 7 7 98— eles?) CIV 5 6 6 97 138.23 IV 
3 8 8 98 5518518, TE 5 4 8 94 - Ta 
3 3 9 100 3LS. 5 9 9 99 13.24 IIT 
3 5 10 96 13.16 a 5 6 10 100 2LS8. 
QerkOs 10: 99) 13.10) CE 6 7 if, 98 13.25 IV 
4 5 5 96 13.20 IV 6 5 (10 96 = Ta 
4 3 6 89 : Ta io 4 8 88 13.2 Ta 
4 7 7 98 13:21 DT a 8 8 98 13.26 IV 
4 4 8 100 2LS. 7 3 9 78 13.1 Ta 
4 3 9 . 89 4 Ta 


* By repetition of the plan for r = 1 — 1, which is constructed by taking at X ¢ 
latin square and omitting the last column. 
} This refers to the method of analysis (see section 13.2). 


TABLE 13.3 INcoMPLETE LATIN SQUARE FoR 4 TREATMENTS IN 7 REPLICATES 


Treatment symbols and yields of tomatoes (pounds) 


Column 
Block I II IIL IV vi VI VII ‘Total 

1 2 2 4 4 3 3 1 

50 72 83 82 76 89 74 526 
2 1 3 3 if 4 2 4 

40 59 71 91 59 73 52 445 
3 4 1 2 3 bk 4 2 

43 57 58 98 54 7 51 432 
4 3 4 1 2 2 1 3 

48 54 74 97 75 75 54 477 


Totals 181 242 286 368 264 308 231 1880 


Adjusted means 
T Q=kT+B’ Q+48 (subtract 11.2) 


1 465 3,781 78.8 67.6 
2 476 3,777 78.7 67.5 
3 495 3,897 81.2 70.0 
4 444 3,585 74.7 63.5 


Check totals or mean 1,880 15,040 67.1 


OE LL 
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13.32 Type Ill. Statistical Analysis When k = it — 1. In this case 
the size of block, which equals the number of replicates, is 1 less than 
some multiple (2) of the number of treatments. Since these designs have 
not been discussed in the literature, an example is given from uniformity 
data. Table 13.3 shows the tomato yields in pounds of 28 single-row 
plots, Hartman and Stair’s data (13.4), on which the design for 4 treat- 
ments in 7 replicates has been superimposed. 

Since the plots measured 6 feet X 24 feet, each column (replication) 
is compact, being 24 feet square. There are obviously substantial dif- 
ferences among replicates. The blocks, although very oblong, may also 
exhibit differences in yield, because each block is a separate row of plants. 

The efficiency factors of these designs are all very high, so that inter- 
block information is ignored in the analysis. 

The steps in the analysis are as follows: 

1. Calculate the column totals, the block totals, the treatment totals 
T, and the grand total G. 

2. A property of these designs is that each treatment, is replicated less 
in one block than in the other blocks. Thus, in the example, treatment 
(1) appears only once in the first block, but twice in all other blocks. 
Similarly, treatment (2) is deficient in block 2 and so on. For each 
treatment calculate the quantity 

Q=kT+B' 
where B’ is the total for the block in which the treatment is deficient. 


For instance 
Q: =7 X 465 + 526 = 3781 


‘As a check, the quantities Q should sum to (k + DG. 
3. All sums of squares in the analysis of variance are obtained in the 


usual way except that for treatments, which is given by the sum of 
squares of deviations of the Q’s, divided by k(k? — 1). In this case we 


have 
(3781)?.-+ (3777)? + (3897) + (3585)? — $(15,040)? _ a 
336 
The analysis of variance is shown below. The degrees of freedom sub- 
divide as in section 13.21. 


df. 8.8. m.8. 

Columns (replications) 6 5387 897.8 

Blocks 3 750 250.0 

Treatments 3 150 50.0 

Error 15 766 51.1 
Total 27 7053 
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The elimination of block differences has substantially reduced the 
error m.s. 

4. To obtain the adjusted treatment mean yields, we first divide each 
Q by (k? — 1), in this case 48. From the resulting quantities we sub- 
tract G/tk(k — 1), or 1880/168 = 11.2. As a further check, the mean 
of the adjusted treatment means should equal the mean yield of the 
whole experiment (67.1). The efficiency factor is (k? — 1)/k, and the 
estimated error variance of the difference between two adjusted treat- 
ment means is 

2H r 


rl 


13.33 Type IV. Statistical Analysis When k = if+1. The analysis 
closely resembles that of section 13.32. For each treatment there is one 
block in which the treatment has extra replication. Thus, in plan 13.17 
fort = 3,r = 7, treatment (1) appears 3 times in the first block but only 
twice in any other block. The only changes in the computing instruc- 
tions of section 13.32 are (i) Q = kT — B’, where B’ is the total of the 
block in which the treatment has extra replication, (ii) the Q’s sum to 
(& — 1)G@, and (iii) for the adjusted treatment means, divide Q by 
(k? — 1), and add to the quotient the quantity G/tk(k — 1). The change 
in the divisor of G should be noted. 

The error variance of the difference between two adjusted means re- 
mains as in section 13.24, 
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CHAPTER 14 


ANALYSIS OF THE RESULTS OF A SERIES OF 
EXPERIMENTS 


14.1 Initial Steps in the Analysis 


44,11 Introduction. In a program of research it is quite common to 
repeat the same experiment at a number of different places, on a number 
of different occasions. There may be several reasons for this. Some- 
times the object of the research is to produce recommendations which 
are to apply to a population that is extensive either in space or in time 
or in both. Thus in agricultural field experimentation, many projects 
are undertaken in the hope that their results can be applied in practical 
farming. The conclusions drawn from such research, if they are to be of 
use, must be valid for at least several seasons in the future and over a 
reasonably large area of farm land. It has been found that the effective- 
ness of the common plant nutrients, of different varieties of a crop, and 
of different cultivation practices usually varies from field to field and, 
even more markedly, from season to season. A single experiment, how- 
ever well conducted, supplies information about only one place and one 
season. Consequently such experiments are carried out at several dif- 
ferent places in the area for which recommendations are wanted, and are 
repeated for a number of seasons. 

Tn other cases we may be interested, not in making inferences about 
some specific population, but in studying the influence of external con- 
ditions on some measurement or on the responses to treatments. For 
example, is the vitamin ‘A content of a vegetable affected by the climate 
in which it is grown? Do the relative durabilities of two types of road 
surface depend on weather or topography or traffic load? Repetition of 
such experiments in different places is necessary in order to have vari- 


the external factors that are under investigation. 

by collaborative experiments on biological 

Here the objects may be to obtain in- 

formation about the accuracy V ith which the potency of a drug can be 

estimated and to discover whether different laboratories reach the same 

about the relative potencies of different drugs. Bliss (14.1) 
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ations in 
‘A third example is supplied 
assay by a group of laboratories. 


conclusions 
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gives an interesting account of a series of experiments of this type, where 
sixteen laboratories estimated the potency of preparations of digitalis by 
injection into cats. 

The appropriate statistical analysis for the data from a series of ex- 
periments will of course vary with the object of the research. Never- 
theless, the preliminary stages of the analysis tend to be the same in all 
cases. In this chapter an introductory account is given of these initial 
procedures, which present difficulties that are not always appreciated. 
The first step, which is to analyze and interpret the data from each in- 
dividual experiment, will be assumed to have been completed. 

When we begin to combine the data, the first point of interest is to 
examine whether the differences among treatments are the same in all 
experiments. This question is likely to be relevant whatever the purpose 
of the experiments. In experiments designed to lead to the recommenda- 
tion of a “best” treatment for some operation, we wish to know whether 
there is a consistent superiority of certain treatments, or whether on the 
other hand we may have to consider recommending different treatments 
for different circumstances. In the other two types of experimentation 
mentioned above, the test would indicate whether the responses to treat- 
ments have varied with the external conditions of the experiment, or 
whether the different laboratories agreed in their estimates of relative 
potency. 

Secondly, it is often, though not always, desirable to estimate and 
compare the average effects of treatments over the whole series of ex- 
periments. 


14.12 Numerical Example. The data come from a group of 6 experi- 
ments on Irish potatoes conducted in two counties of North Carolina in 
1945 and 1946. The results have been described by Nelson and Hawkins 
(14.2), and form part of a study of the responses to applications of super- 
phosphate on soils of varying degree of fertility. In 1945, on each of the 
6 sites, the amount of readily soluble phosphorus in the soil itself was 
estimated from soil samples by the modified Truog method. These 
amounts varied from 48 to 850 lb. P.O; per acre. The treatments com- 
prised 5 different levels of application, 0, 40, 80, 120, and 160 lb. P20; 
per acre. 

These experiments are an example of the second type discussed above, 
in that their object was to find out the extent to which the responses to 
treatments were influenced by the condition of the soil. The soils were 
not intended to be a representative sample of the soils in the counties, 
but were chosen so as to give a wide range in fertility. 
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Potatoes were grown on the plots in 1945 and 1946, the treatments 
being applied in both years, with in addition substantial applications of 
nitrogen and potash on all plots so that these nutrients would not be 
deficient. The data to be analyzed are the 1946 results. The mean yields 
and error m.s. per plot are shown in table 14.1. In five of the six experi- 


TABLE 14.1. Treatment yiecps (100 us. per acre) or IrisH poratous, 1946 


Experiment number 


Pounds P205 1 22 o98 4 5 6 
applied Amount of readily soluble phosphorus 
per acre (pounds P205) in soil, 1945 


48 310 410 710 790 850 
0 114 142 180 170 130 170 

40 206 176 188 171 140 178 

80 231 201 207 188 150 187 

20 

60 


1 937 | 205 | 208 | 185 | 152 | 188 
1 252 | 217 | 217 | 189 | 156 | 189 
s?* 106 | 83 | 158 | 428 | 204 52 
df. 24 12 12 12 12 12 
rt 4 5 5 5 5 5 


* Error m.s. per plot. 
+ Number of replications. 


ments the F-ratios for treatments against error were significant. One 
feature of the results in table 14.1 is that experiment 1, which has the 
lowest amount of available phosphorus in the soil according to the soil 
tests, actually gave the highest total yield in 1946. It should be re- 
membered that the soil tests were made in 1945, and that some residual 
effects of the 1945 applications may have persisted. Whatever the rea- 
son, this soil was apparently very responsive to phosphorus in 1946; the 
plots without phosphorus do have the lowest yield of all places. 

It is possible to compute from table 14.1 a combined analysis of vari- 
ance for all six experiments. As will be seen later, there are limitations 
to the use of such an analysis. For the present we will ignore any com- 
plexities that may arise. Further, in order to discuss only the simplest 
case at first, we will omit experiment 1, which differs in size and structure 
from the other experiments. This omission is of course inappropriate 
from the agronomic point of view, but the data are being used to throw 
light on the general procedure in analysis rather than on the agronomic 


questions involved. 
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14.18 Preliminary Combined Analysis When all Experiments Have the 
Same Design. Formally, the analysis subdivides into the following 
components. 

Places 

Treatments 

Treatments X places 

Pooled experimental error 


In addition, in a complete analysis there are terms representing the dif- 
ferences among rows and columns in the individual experiments. Since 
these are not relevant to the interpretation, they are not included. 

The analysis can be computed from either treatment means or totals; 
the totals will be used here and are shown in table 14.2. 


TABLE 14.2 TreatMEnr TOTALS FOR EXPERIMENTS 2 To 6 (IN 100 LB.) 


Pounds Experiment 
POs | 2 3 4 5 sual as 
0 710 900 850 650 850 | 3,960 
40 880 940 855 700 890 | 4,265 


80 1,005 | 1,035 940 750 935 | 4,665 
120 1,025 | 1,040 925 760 940 | 4,690 
160 1,085 | 1,085 945 780 945 | 4,840 


Total | 4,705 | 5,000 | 4,515 | 3,640 | 4,560 | 22,420 


The first 3 components are computed in the same way as for a single 
randomized blocks experiment, except that all sums of squares are di- 
vided by an extra 5 in order to reduce them to a single-plot basis. Thus 
we have 
(4705)* +--+-+ (4560)? ct (22,420)? 


Places: = 
25 125 eae 
cares 2 2 
rentuenta ee ot 2) = 20,979 


To obtain the treatments X places s.s. we calculate the total s.s. for 
table 14.2, which comes to 69,199. Then 


Treatments X places: 69,199 — 41,367 — 20,979 = 6853 


Since all experiments have the same number of error degrees of free- 
dom, the pooled error m.s. may be found as the simple average of the 
five s? values in table 14.1. This comes to 108, as given in table 14.3. 

The null hypothesis that the treatment differences are the same at all 
places (i.e., that there are no treatment X place interactions) is tested 
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by the F-ratio 428/108, or 3.96, with 16 and 60 d.f., respectively. Since 
the 5% level is 1.81, the ratio is definitely significant. 


TABLE 14.3 ComMBINED ANALYSIS OF VARIANCE (ON A SINGLE-PLOT BASIS) 


Source of variation df. 8.8. m.8. 
Places. 4 41,367 
Treatments 4 20,979 5,245 
Treatments X places 16 6,853 428 
Pooled error 60 6,480 108 


‘A test of the average responses to treatments taken over all five places 
is of minor interest in this example, because the places do not constitute 
a random sample from a population about which we wish to make in- 
ferences. For purposes of illustration, however, we will assume that the 
places were selected as a random sample of fields on which potatoes might 
be grown commercially. 

In the F-test of the average effects of treatments there are two possible 
candidates for the denominator of F—the mean square for the treat- 
ments X places interactions (428) or the pooled error m.s. (108). Some- 
times this competition does not arise, because our knowledge of the data 
and the F-test of the interactions both indicate that there is no reason to 
suppose real interactions to be present. In that event the mean squares 
for interactions and error may be pooled to form a single denominator 
for the F-test of treatments. But frequently the experimental conditions 
are such that we expect interactions to be present. This is so in the ex- 
ample, where previous work in a number of countries has shown a rela- 
tion between the response to phosphorus and the amount found in the 
il test. Consequently, even if the interactions m.s. had not 


soil by a so! 
unwilling to assume that there 


proved significant, we might have been 


were no interactions. 
In discussing the two F-ratios it is helpful to examine the mathematical 


model on which the combined analysis is based. If 2; is the observed 
mean for the jth treatment at the th place, we postulate that 


ayy = wt om Hg mag + (14.1) 


represent the effects of the place and the treatment, respec- 
treatment X place interaction, and @; that of the 
‘ors on the 7 plots that re- 


where 73, 7; 
tively, y;; that of the 
experimental error. @j is the average of the err 
ceive the treatment at that place. 
With this model we can study the 
analysis of variance. If the experime 
a variance o-”, and if the interaction terms pi; 


nature of the mean squares in the 
ntal errors on individual plots have 
may be considered to have 
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a variance o,”, the average values of the mean squares work out as fol- 


lows. 
TABLE 14.4 ExPEcTED VALUES OF MEAN SQUARES 


Source of variation Expected value of mean square 
Treatments oe + 10,7 "STE 7 (3-7)? 
Treatments X places oe + Toy? 

Pooled experimental error oe 


The symbols 7, p, and ¢ stand respectively for the numbers of replica- 
tions, places, and treatments. 

Tt will be noted that the treatments m.s. is influenced by three com- 
ponents—the experimental error variance, the variance of treatment 
place interactions, and the variance among the true treatment means 
7;. The pooled experimental error takes account of only the first of these 
components. Hence the treatments m.s. may be statistically significant, 
as compared with the pooled error m.s., either because there are real dif- 
ferences among the 7’s or because treatment X place interactions are 
present. This /-test is informative only when we are indifferent as to 
whether a significant treatments m.s. was due to real treatment dif- 
ferences or to interactions. This situation is rare in practice. In most 
cases, on the contrary, having established that interactions are present, 
or at least that it is not safe to assume them absent, we wish to know 
whether in addition there are consistent differences among the effects of 
treatments. As table 14.4 shows, the appropriate denominator of F for 
this test is the interactions m.s. This contains both the interaction and 
error components of variation in exactly the same way as they enter into 
the treatments m.s. The F-ratio, 5245/428, or 12.26, with 4 and 16 d.f., 
is significant at the 1% level. 

The conclusions from this initial analysis are: (i) there are real dif- 
ferences in response that are consistent from place to place, and (ii) there 
are real variations in responsiveness from place to place. It need not be 
stressed that these statements do not constitute a competent summary of 
the results. They merely indicate what should be examined next. In 
the present case we would consider which treatments have proved con- 
sistently superior, and, if possible, why this is so. Similarly, we should 
investigate the nature of the interactions and try to find a rational ex- 
planation for them. 


14.2 Criticisms of the Preliminary Analysis 


14.21 Heterogeneity of the Interaction Variance. The previous anal- 
ysis is open to several criticisms. Although these criticisms are not valid 
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for all series of experiments, our experience is that it is well to be on one’s 
guard against them. They deal essentially with the assumptions on - 
which the combined analysis was based. 

These assumptions have been presented in section 14.13. Specifically, 
we postulate that the model (14.1) holds. Further, the experimental 
errors @;; of individual observations are assumed to be normally and in- 
dependently distributed with the same variance a.*. Finally, for the 
F-test of the treatments m.s. against the treatments X places inter- 
actions, we require also the assumptions that the interaction terms 4,; 
are normally and independently distributed, with zero population means 
and the same variance o,”, and are independent of the e’s. 

The first criticism is that some components of the treatments X places 
s.s. may be much larger than others, or in mathematical terms that the 
‘Gnteraction” variance o,” is not constant. This will happen if the ef- 
fectiveness of some treatments varies greatly from place to place, while 
that of others varies less or not at all. 

If the interaction m.s. is heterogeneous in this sense, the F-test of 
treatments against interactions is vitiated. The general effect is that the 
F value read from the tables is too low, i.e., that too many significant re- 
sults are obtained. Some idea of the extent of the bias can be obtained in 
certain extreme cases. In the numerical example, if one comparison 
among the treatment means has a much larger interaction variance than 
any other comparisons, the /’-ratio is distributed approximately as an 
F value with 1 and 4 d.f., respectively, instead of 4 and 16 d.f. The 5% 
significance level would be 7.71 instead of 3.01. More generally, if there 
are p places, F is distributed approximately with 1 and (p —1) df. 
This situation produces about the greatest distortion in F’ that is likely 
to arise, so that 7.71 could be regarded as an upper limit to the signifi- 
cance level of F. 

The exact distribution of / can be worked out, but is not available in 
a form adapted for practical use. Even so, in many instances, uncer- 
tainty about the correct significance level for F does not preclude us from 
drawing conclusions from the test. For, in the example above, if F 
turned out to be 1.38 we would be confident that it is not significant, since 
the 5% level is at least 3.01. Similarly, there seems little doubt that the 
F value actually obtained in the experiments, 12.26, is significant, since 
it lies well above 7.71. On the other hand, we would be uncertain about 
significance if F happened to lie between 3 and 7. 


‘A better method of coping with this difficulty is to divide the treat- 


ment s.s. into a set of orthogonal components that will supply all or most 
The interactions s.s. is partitioned 


of the information of interest to us. ; 
in the same way so as to isolate the interactions of each component with 


places. By Bartlett's test for homogeneity of variance, reference (14.3), 
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we can then test whether o,” is the same for all the components of the 
interactions s.s. If we decide that o,? can be assumed constant, the 
difficulty vanishes. If o,” is not constant, it is valid to test any com- 
ponent of the treatments s.s. against its own interaction with places. 
Apart from the extra computations, the only drawback to this pro- 
cedure is that the degrees of freedom in the denominator of F are reduced. 

These remarks will be illustrated by the example. In these experi- 
ments the principal point of interest is to see whether the average re- 
sponse to a given amount of phosphorus decreases when the amount 
presumed to be in the soil increases. It may also be worth while to ex- 
amine whether the rate of decline in response with the higher levels of 
dressing changes with the nature of the soil. These questions can be 
considered conveniently by fitting at each place a parabolic regres 
of the yield on the amount of dressing. Although a parabolic regression 
would scarcely be regarded as the true form of the response curve, it ap- 
pears to fit the data well. 

Since the dressings increase by equal amounts, the orthogonal poly- 
nomials given by Fisher and Yates (14.4) are suitable. The calculations 
are shown below for experiment 3. The linear term is proportional to 


Multipliers for 


—— 
Amount Total Linear Quadratic 
of P205 yield term term 
0 900 —2 2 
40 940 -1 -1 
80 1035 0 —2 
120 1040 1 =1 
160 1085 2 2 
Sum of products 470 —80 
Divisor for square 50 70 


the average increase in yield per 40 lb. of P20;: the quadratic term may 
be interpreted as measuring the rate of decline in response with increased 
dressings. The values found for the two terms are as follows. 


Experiment 
2 3 4 5 6 Total 


L 895 470 260 320 240 2185 
Q | —325 —80 0) —100 —110 | —685 


The contributions to the treatments s.s. are 


(2185)? 2 


i 685 
Linear: = 19,097; quadratic: sea = 1341 
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In the treatments X places s.s. we have 


(895)? +--+ ++ (240)? 
L X places: a 60) 19,097 = 5893 


(325)? +--+ (110)? 
70 


In table 14.5 the three interaction mean squares are in the same order of 
size as the corresponding treatment mean squares. This result is 
typical, in that large effects tend to have large interactions. It is obvious 
on inspection that the components of the interaction m.s. cannot be 
regarded as homogeneous. Accordingly, we test the linear component . 
of treatments against the mean square 1473 instead of against the com- 
plete treatments X places m.s. of 428 in table 14.3. This makes a sub- 
stantial difference to the F-ratio, though it remains significant. The 
quadratic X places m.s. (161) is not significantly above the error m.s. 
(108), but it seems prudent to use it as the denominator in the F-test of 
the quadratic term in treatments. The remainder of the interactions 
shows no sign of significance, and could be pooled with error for an /’- 


test of the deviations from regression. 


1341 = 645 


Q X places: 


TABLE 14.5 SuBDIVISION OF THE ANALYSIS OF VARIANCE 


Source of variation df. 8.8. ms. 
Treatments 
Linear 1 19,097 19,097 
Quadratic iE 1,341 1,341 
Deviations from regression 2 54 270 


Treatments X places 


Linear X places 4 5,893 1,473 
Quadratic X places 4 645 161 
Deviations X places 8 315 39 

60 6,480 108 


Pooled error 


erimental Error Variances. A second 
criticism concerns the assumption that the experimental error variances 
o2 are the same in all experiments. In general this assumption will hold 
only if all experiments have been conducted in the same way, with the 
game amount of control over environmental conditions and with experi- 


mental material of the same variability. In experiments with crops or 


animals this degree of uniformity is seldom attainable, because the 
among animals at one place 


natural variability among pieces of land or 
differs from that at other places. Hence in this and many other types of 
cooperative experimentation we expect a priort that experimental error 


variances change from place to place. 


14,22 Heterogeneity of the Exp 
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When there is doubt about this point, Bartlett’s test of homogeneity 
of variances can be applied tod the error m.s., s*, in the experiments. In 
the example (table 14.1) the s* values “pel from 43 in exper’ He 4 to 
204 in experiment 5, and the test of homogeneity shows that oe cannot 
be assumed constant. 

Variation in o,2 invalidates the F-test of the interactions m.s. against 
the pooled error m.s. Although the effect on the significance level of /’ 
is not known exactly, it operates so that use of the tabular F produces 
too many significant results. As in the previous section, the extent of 
the distortion in F can be seen in extreme cases. If one experiment has 
a much higher error variance than any of the others, /’ will be distributed 
approximately as the tabular / with (¢ — 1) and n’ degrees of freedom, 
where ¢ is the number of treatments and n’ is the number of error degrees 
of freedom in the experiment with the high error variance. In the ex- 
ample, if one of the experiments with 12 error d.f. happened to be much 
less accurate than the other experiments, 7 would have 4 and 12 d.f. in- 
stead of 16 and 60 as used in the test in section 14.13. The 5% level 
would be 3.26 instead of 1.81. Since this case appears to be the most un- 
favorable that would occur, we may conclude that the true significance 
level of F lies somewhere between these values. 

If the observed F in our data falls outside these limits, a definite con- 
clusion can be reached from the test without further knowledge of the 
exact significance level of 7. This happens in the example, where the 
observed F-ratio for the interactions m.s. is 3.96. However, even if the 
exact significance level were known, the test can be criticized because 
the F-ratio is no longer the most sensitive test criterion. Studies have 
shown that, with the amount of variation in o,” that appears typical of 
agricultural experimentation, this loss of sensitivity might be equivalent 
to discarding 10 to 20% of the data. An approximate test that avoids 
some of the loss in sensitivity will be presented in section 14.4. 

Thus far we have been discussing how the F-test of the interactions is 
affected by heterogeneity in the error variances. The F-test of treat- 
ments may now be considered. If the interactions are regarded as neg- 
ligible, so that the denominator of F is the pooled m.s. for interactions 
and error, the F-test of treatments is invalidated in about the same way 
as that of interactions. On the other hand, when interactions are 
present, and especially when they are large, the F-test of the treatments 
m.s. against the interactions m.s. is much less disturbed. The reason 
may be clearer if we consider the test of the linear component of treat- 
ments in table 14.5. The suggested denominator for this test was the 
linear X places component, 1473.. This is nearly 14 times the pooled 
errorm.s. It appears, therefore, that heterogeneity in the error variances 
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can influence only a small part of the interaction m.s., so that its effect 
is, as it were, greatly reduced. These remarks apply with much less force 
to the quadratic X places m.s., to which, taking the data at their face 
value, the pooled error contributes more than half. 


14.23 Summary. When all experiments are identical in structure, a 
combined analysis of variance can be computed relatively easily. Ap- 
plication of the ordinary tests of significance to this analysis is frequently 
open to question because of heterogeneity in the error and interaction 
variances. Despite this, it is advisable to draw preliminary conclusions 
as far as possible from this analysis, at least in the present state of our 
knowledge, because tests that are fully efficient and theoretically sound 
have not yet been discovered, and the approximate tests that have been 
devised to meet the criticisms are more laborious. 

When the interactions are sizable, there is usually little difficulty in 
interpreting the combined analysis. For, if the /-ratio for interactions 
is large, we may be confident that it is statistically significant in spite of 
some uncertainty about the exact significance level of F, and the crit- 
icism that the F-test is not fully sensitive carries less weight if F estab- 
lishes significance. Further, as we have seen, the F-test of the treat- 
ments m.s. against the interactions m.s. is little affected by inequality 
in the error variances when the interactions are large. The chief point 
to remember is that the interactions m.s. may itself be heterogeneous. 
Subdivision of the treatments and interactions m.s. according to the 
treatment comparisons that are of greatest importance is often 
useful. 

Another situation that presents little difficulty occurs when the aver- 
age differences among treatments are substantial, yet interactions are 
negligible. In this case it may be found that neither the /’ value for in- 
teractions nor that for treatments is close to the significance level. The 
cases that leave us in doubt are those where the F’ values are just slightly 


above the tabular significance levels. 


14.3 Experiments of Unequal Size 


44.31 Numerical Example. Provided that the same set of treatments 
appears in all experiments, & combined analysis can usually be con- 
structed even when the experiments differ in size and structure. Some 
issues arise, however, that are not encountered when all experiments are 
identical in design. The problems will be illustrated by the inclusion of 
the first experiment in the previous example. This was arranged in ran- 
domized blocks with 4 replications, whereas all other experiments were in 
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5 X 5 latin squares*, The treatment totals and numbers of replications 
are shown in table 14.6. 


TABLE 14.6 TREATMENT TOTALS (in 100 Ib.) 


Pounds Experiment Total No. of 
P205 1 2 3 4 5 6 plots 

0 456 710 900 850 650 850 | 4,416 29 

40 824 880 940 855 700 890 | 5,089 29 

80 924 | 1,005 | 1,035 940 750 935 | 5,589 29 

120 948 | 1,025 | 1,040 925 760 940 | 5,638 29 

160 1,008 | 1,085 | 1,085 945 7380 945 | 5,848 29 

Total 4,160 | 4,705 | 5,000 | 4,515 | 3,640 | 4,560 | 26,580 145 
No. of plots 20 25 25 25 25 25 


A combined analysis can be obtained by following the standard pro- 
cedure for data based on unequal numbers. ‘The square of any quantity 
is divided by the number of replications involved. The sums of squares 
are given below. 


(456)? +--+ (1008) i (710)? +--+ (045)? (26,580)? 


Total: = 131,925 

ae 4 5 145 Brak 
(4160)? (4705)? +-+-+ (4560)? (26,580)? 

Places: } = = 

laces, t % 15 55,509 
(4416)? +--++-+ (5848)? (26,580)? 

Treatments: ———_——— — —___— = 
‘reatment 20 15 45,613 

Treatments X places: 131,925 — 55,509 — 45,613 = 30,803 


The pooled error m.s. requires a little thought. If all experiments 
have the same true error variance o,”, the best procedure is to weight 
each error m.s., s, by the number of degrees of freedom n,. But if the 
experiments vary in accuracy, this weighted mean is a biased estimate 
of the component of error variance that enters into the interaction and 
treatments m.s. The correct component is > rio;?/>r; instead of 
> nioi?/ >) n;. Consequently, unless we are confident that o,2 is con- 
stant, it is best to weight the s; values by the numbers of replications. 
This gives 

_» _ 4(106) + 5(83 + 158 +---+ 52) 


8, = 
29 108 


*Since experiment 1 contained additional treatments (not discussed here), it pro- 
vided 21 error d.f. 
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: The F-test of the interactions m.s. against the error m.s. is carried out 
in the same way as with experiments of equal size, and is subject to the 
criticisms previously discussed. The F-test of the treatments m.s. 
against the interactions m.s. encounters a new difficulty that is due to 


TABLE 14.7 PRELIMINARY ANALYSIS OF VARIANCE FOR EXPERIMENTS 
OF UNEQUAL SIZE 


Source of variation df. 8.8. ms. 
Places 5 55,509 11,102 
Treatments 4 45,613 11,408 
Treatments X places 20 30,803 1,540 
Pooled error m.s. 81 108 


the unequal numbers of replications and is present even if all the as- 
sumptions required for the analysis of variance are satisfied. Under 
these assumptions, the expectations of the principal mean squares in 
table 14.7 are as given in table 14.8. 


TABLE 14.8 Expecrep VALUES OF MEAN SQUARES (WITH UNEQUAL NUMBERS OF 


REPLICATIONS) 
Source of variation Expected value of mean square 
ae: pes DiGi 
Treatments oe + Ao? + (D7) ENS ¢ 
Treatments X places oe + Feo," 
Pooled error oe 
where 
2 141 
A= Dri Ml _ 4 962 
Dr 
1 1 
7g = —— ; — 71) = = (29 — 4.862) = 4.828 
fait) (ni — fi) 5° ) 


tor is that the coefficient of o,” in the expected 


The complicating fac 
in the expected treatments X 


treatments m.s. is not the same as that 
places m.s.; in fact, it is always larger. The difference is very small in 
this example, as it is whenever the experiments do not vary much in 
numbers of replications. In addition, and for the same reason, the in- 
teractions m.s. is not distributed as a multiple of chi-square; that is, it 
does not have the type of distribution required for the validity of the 
F-test. Both factors tend to make the F-test give too many significant 


results. 
‘A crude method which at le 


astray is to adjust the F-ratio so as 
pectation. Let the expected values 0! 


ast prevents us from being led too far 
to remove the upward bias in ex- 
f the three mean squares be de- 
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noted by 6;, 8p, and 6., respectively. Then, if the null hypothesis is true 
(all 7; equal), it is seen from table 14.8 that the three expected values are 
connected by the equation 


0: = (k tn. 1)6 ma key 


where k = 7,/7. In other words, the test that we seck is a test that this 
relation holds. This suggests that 7 might be computed as 


Re s? + (k—1)8-2 11,403 + 0.007(108) 
sip? ~— (1.007)(1540) 


as compared with the original F-ratio of 11,403/1540, or 7.40. The sig- 
nificance level of F’ will also be altered slightly from that in the table. 

When interactions are likely to be large, and the chief interest centers 
in the test of the treatments m.s. against the interactions m.s., an al- 
ternative approach has much to commend it. This is to compute the 
analysis from the unweighted treatment means given in table 14.1, paying 
no attention to the numbers of replications. With this type of analysis 
the bias disappears, the expectations of the mean squares being shown 
in table 14.9, It will be noted that c,” and o,” carry the same coefficients 


F’ 7.35 


TABLE 14.9 Exprcrep VALUES OF MEAN SQUARES IN AN UNWHIGHTED 
ANALYSIS 


Source of variation Expected value of mean square 


: .— 7)? 
Treatments SE sete), XLG-7 
qT) 
Treatments X places aL oe 
fi 


in both expectations. The divisor 7, is the harmonic mean of the num- 
bers of replications, given by 7), = p/ >> (1/r). Moreover, if o,,” is much 
larger than ¢,”/7;, the interactions m.s. tends to be distributed as a 
multiple of chi-square, so that the conditions for the F-test are closely 
approximated. 

It is suggested that the pooled error m.s. for insertion in an analysis 
of this type be calculated from the formula 


4 1 3° 89” 857 
ee = (E42 4.4%) 
PAT T2 Tp 


By an extension of the results in table 14.9 to the case where the experi- 
ments have different error variances, this quantity is found to be an un- 
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biased estimate of the error component that enters into the mean squares 
for treatments and treatments X places. For the data in table 14.1 we 
have 


1/106 83+ 158 +---+ 52 
aft £ 
5. “(5 4F ; ) 22.4 


TABLE 14.10 UNW5EIGHTED ANALYSIS OF VARIANCE OF TREATMENT MEANS 


Source of variation df. 8.8. m.5. 
Places 5 11,692 2,338 
Treatments 4 10,554 2,638 
Treatments X places 20 7,158 358 
Pooled error 81 22.4 


Table 14.10, being in terms of treatment means, is in different units 
from table 14.7 (p. 403). As a conversion factor, the harmonic mean 7), 
which in this case is 4.8, may be used to multiply the mean squares above 
for comparison with those in table 14.7, though this conversion is not 
needed for our purpose. 

In a choice between a weighted and an unweighted analysis, the fol- 
lowing are the relevant considerations. The statements below are 
strictly true only if all experiments have the same error variance per 
observation (or plot), though they should remain substantially true with 
a moderate variation in o;°. The weighted analysis is superior for the 
F-test of the interactions. It gives a more powerful test, and the F-ratio 
approximates the tabular distribution of F more closely. It is also 
superior for the F-test of treatments if interactions are non-existent or 
small, since in this case the bias in the test is negligible. The unweighted 
analysis is preferable for the F-test of treatments when the interactions 
are not negligible. As it happens, the results in the example are not too 
well in accord with these statements. For the interactions test, the /’- 
ratio is 14.26 with the weighted analysis and 15.98 with the unweighted 
analysis. The explanation of the higher value with the unweighted 
analysis is probably that experiment 1, whose responses differed from 
those in the other experiments, had the smallest number of replications. 
The values of F in the test of treatments were practically identical, being 
7.35 (after adjustment) in the weighted analysis and 7.37 in the un- 


weighted analysis. 
Series of Lattice Experiments. In 


was pointed out that if inter-block 
of treatments in an individual ex- 


14.32 The Combined Analysis for a 
the discussion of lattice experiments i 
information is recovered, the F-test 
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periment is not exact, because the relative weights attached to inter- and 
intra-block estimates are subject to sampling errors. For the same rea- 
son any combined analysis must also be approximative rather than exact. 

The designs need not be identical at all places. With 25 treatments, 
for example, some experiments might be simple lattices in 4 replications, 
while others are lattice squares in 3 replications. Also, inter-block in- 
formation need not be recovered in all analyses; at some places we might 
have used a randomized blocks analysis. 

If the experiments have different numbers of replications, the pre- 
vious section has indicated that there is a question whether to use a 
weighted or an unweighted analysis. The former is preferable when the 
chief purpose is to test the interactions with places, and when such in- 
teractions are negligible; the latter when interactions are sizable and the 
principal interest is in a test of the average effects of treatments. With 
lattice experiments as used in agriculture, we are often interested in both 
interactions and average effects. But as interactions are seldom absent, 
the unweighted analysis is frequently advisable, and has the virtue of 
being slightly simpler. 

The weighted analysis will be described first. First form a two-way 
“treatments X places” table similar to table 14.6. The entries in the 
table are the adjusted treatment totals, except in experiments analyzed 
by randomized blocks, where unadjusted totals are used. This table is 
analyzed into components for 


Places 
Treatments 
Treatments X places 


If the numbers of replications 7; differ, the weighted analysis follows the 
same procedure as in table 14.6 (p. 402). The square of every marginal 
treatment total is divided by > r; in order to reduce the analysis of vari- 
ance to a single-observation basis. The square of a place total, summed 
over all treatments, is divided by ér;, where ¢ is the number of treatments. 
The only additional component is the pooled error m.s. In any ex- 
periment where inter-block information has been recovered, we use as 
the estimate of the error variance per observation (or plot) the effective 
error m.s. Z;’, This quantity is obtained by adjusting the intra-block 
m.8, upwards go as to allow for sampling errors in the block corrections, 
and its calculation is explained along with the analysis for each type of 
lattice. For the triple lattice, for example, Z/ is eft + el 
there are any experiments in which the intra-block analysis has been 
carried out, Z,/ is given the value which it takes when Hy becomes very 


—— a 
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large relative to Z;. For experiments analyzed by randomized blocks, 
E,/ is the ordinary error m.s. 

On the assumption that the true effective variances are unlikely to be 
the same in all experiments, the pooled error m.s. is calculated as 
> rBi/ > 7;. When 7; is constant, this reduces to the unweighted 
mean of the H;’., Tests of significance are made as in the numerical ex- 
ample, and are subject tothe same general criticisms. Since the larger 
lattice designs provide ample degrees of freedom for error, the pooled 
error may be relatively well determined even if some experiments are 
much more accurate than others, so that there is less uncertainty about 
the true significance levels of the F-ratios. 

For an unweighted analysis with experiments of unequal size, con- 
struct a two-way table of adjusted treatment means. This is analyzed 
by the simple standard procedure. The comparable pooled error m.s. is 


taken as 
1/E, . E,! E,’ 
ei (M y+) 


" T2 ? 


With balanced designs, the method given above is a natural extension 
of the technique used to obtain an approximate /’-test of treatments in 
an individual experiment. With partially balanced designs the method 
is more crude than that used in single experiments, where a special sup- 
plementary calculation was made for the F-test. This calculation can be 
extended to apply to a combined analysis, but it is doubtful whether the 
elaboration is worth while. 


14.4 A Test of the Treatments X Places Interactions 


In this section we give an approximate test of the treatments X places 
interactions for series of experiments where there is considerable varia- 
tion in the experimental error variances. It was suggested that the F- 
test, despite its imperfections, will often serve our purpose. The present 
test differs from the F-test in that it gives less weight to experiments 
which have a high error variance, and consequently may be expected to 
be more sensitive. It may be useful in cases where there is doubt about 
the verdict given by the F-test and in cases where the most efficient 
analysis is desired. 

As before, let 2;; be the mean of the jth treatment, and o;” be the true 
error variance per observation in the 7th experiment. We assume that 
all experiments have the same design, and let s;° be the error m.s. per ob- 
servation in the ith experiment, based on n degrees of freedom. 

For known values of 2 the theory of least squares indicates that the 
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mean 2;; should receive a weight W; = r/c;7. In a weighted analysis of 
variance of the treatment means, the interaction s.s. is known to be 
distributed as chi-square with (p — 1)(¢ — 1) degrees of freedom. This 
sum. of squares appears to be the best test criterion available, unless we 
possess specific information about the type of interaction that may exist, 
in which event a more specialized criterion would be constructed. 

In default of knowledge of the o;*, the natural step is to consider a 
weighted analysis with weights w; = r/s;*. That is, we weight each 
mean inversely as its estimated variance, since we do not know its true 
variance. This is satisfactory, provided that the s,? are good estimates 
of the «2. Upon examination it appears that the s,7 should be based on 
at least 15 d.f. To illustrate the calculations, the test will be applied to 
the data from the last 5 experiments, though the number of degrees of 
freedom, 12, is slightly too low to use the test with full confidence. 


TABLE 14.11 Trearmenr MEAN yreELDs (100 LB. PER ACRE) FOR A WEIGHTED 
ANALYSIS OF VARIANCE 


Pounds Experiment Weighted total 
P:05 2 3 4 5 6 Di rwstig = Tj 
10) 142 180 170 130 170 53.570 
40 176 188 171 140 178 57.000 
80 201 207 188 150 187 62.194 
120 205 208 185 152 188 62.264 
160 217 217 189 156 189 63.932 
W; 0.060 0.032 0.116 0.025 0.096 0.329 = W 
Total (P,) 941 1000 903 728 912 
w,Pi 56.460 82.000 | 104.748 18.200 87.552 | 298.960 = @ 
8.8, (Si) 180,655 | 200,946 | 163,431 | 106,440 | 166,618 


The arrangement of the data in table 14.11 should be noted. The treat~ 
ment means and s,? values come from table 14.1 (p. 393). The weights 
for individual entries in the table are placed in the row immediately be- 
low the entries; for example, w, = 5/83 = 0.060. The total of the 
weights is denoted by W. 

1. Form the column totals P; and the products w;P; Form the 
weighted row totals. The corner total, @ = 298.960, supplies a check 
on both the column and row totals. 

2. The items in the analysis of variance are obtained as follows. 

Correction term: 


C = — = ——_—_~. = 54,832.57 
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Total: First compute and record in the table the sum of squares S; 
(uncorrected) of the entries in each column. The total s.s. is then found 
as 
DY (wiS) — C = (0.060) (180,655) +--+ ++ (0.096) (166,618) — C = 

551.33 

Places: A column (place) total has variance to;”/r, and hence receives 

a weight w;/t. The sum of squares is 
4D (wiP?) — C = 230.09 
This is found readily by using the products w;P;. 

Treatments: Each treatment total has estimated weight W. The sum 

of squares is 
eur? a (53.570)? -+ +--+ (63.982)? 
Ww 0.329 

Treatments X places: Since this is the only component in which we are 
currently interested, it is unfortunate that it must be found by means of 
the others. As usual, it is given by 


551.33 — 230.09 — 229.57 = 91.67 = I 


C = 229.57 


The sum of squares J does not follow a chi-square distribution, being 
inflated by errors in the weights. It can be reduced to a quantity that is 
distributed approximately as chi-square.* We take 


(n — 4)(n — 2) i wath (p—1I)t-D)a-4) an 


2 
n(n +t — 3) (n+t— 3) 


x 


In this case 


ee af a UB) _ 
uy (12)(14) GLO) i> 68:00) WHE a = 9.14 df. 


In this approximation the degrees of freedom ascribed to chi-square are 
not integral. Significance levels are obtained by linear interpolation in 
the tables of chi-square. Since the 1% levels of chi-square are 21.67 and 
23.21 for 9 and 10 d.f., respectively, I is obviously significant. 

The test can also be applied to any component of the interactions 8.8. 
As an example, we will test the interaction of the quadratic component 
of the regression on amount of phosphorus. This was previously tested 
by the F-test in table 14.5, and found non-significant. The values given 


x 


* This test is an extension and modification of a test previously given by Cochran 
(14.6). 
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for the quadratic components in section 14.21 were derived from treat- 
ment totals; to obtain corresponding values from means, we divide by 5. 


Experiment 
2 3 4 5 6 Total 
Qa —65 —16 —14 —20 —22 
wiQ: | — 3.900 | — 0.512 | — 1.624 | — 0.500 | — 2.112 | —8.648 


Each quantity has variance 14 o7/r, and receives a weight w;/14. The 
sum of squares is 


1 (2 ay 1 
— iQ? — = — (340.89 — 227.32) = 8.11 =I 
2 (Swe SEO”) 3 


The quantity I, is the sum of squares for the interaction of a single 
treatment comparison with all p places, and would normally have 
(p — 1) degrees of freedom. To present the general formula for con- 
version to chi-square, suppose that we have computed the sum of squares 
for the interaction of ¢, treatment comparisons with p; of the places, so 
that there would normally be t,(p; — 1) degrees of freedom. Then 


2 (n —-4)(n — 2) ee jy Pa 1)(n — 4) at. 

n(n + t, — 2) (n+ t — 2) 
This formula is in accord with that used for the whole (¢ — 1)(p — 1) 
degrees of freedom in the interactions. Since the ¢ treatments provide 
(t — 1) independent treatment comparisons, we would take t; = (t — 1) 
in applying the formula to the complete interactions s.s. For 4 = 1, 
p= 5, 


2 


eae (8)(10) (8.11) = 4.92, with M6) 
(12) (11) (11) 


The tables show that this value has a probability between 0.2 and 0.1, 
since by interpolation the 20% level for 2.91 df. is 4.51, the 10% level 
6.10. The earlier F-test gave a probability slightly over 0.2, so that the 
two tests are in close agreement. 

If the experiments differ in size, the weight w; becomes r;/s;°. The 
numbers of error degrees of freedom will also vary. This necessitates a 
more elaborate conversion formula: as a rough approximation the aver- 
age number of error degrees of freedom per experiment is used in place 


= 2.91 df. 


x 


of n. 
The test is not recommended for low values of n, because the s,? are 
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relatively poor estimates of the «7, and frequently one or two experi- 
ments receive such high weights that they dominate the analysis. Ap- 
propriate methods are presented by Yates and Cochran (14.5) and 
Cochran (14.6). 


14.5 Repetitions in Both Space and Time 


As has been mentioned, agricultural field experiments are often re- 
peated both at a number of places and for a number of years. We will 
give only an introduction to the simplest case, in which experiments have 
been carried out at each of p places for y years. At any place a new site 
is chosen each year for the experiment, and a new randomization em- 
ployed, so that the data from successive years may be regarded as 
independent. The experimental arrangement need not be uniform 
throughout the whole series of experiments: in fact, a change in design 
between seasons is not uncommon. 

The mathematical representation of the mean of the jth treatment at 
the ith place in the kth year is now more lengthy. In addition to the 
symbols previously used for place and treatment effects, Yi will denote 
the effect of the year. Interactions are denoted by multiple symbols: 
thus (x7); is the contribution of the place X treatment interaction in 
this experiment. With this notation 


aie = ebm te + os + (wy)ix + (ry)in + Cerise 1 sik 


Note that there are treatment X place, treatment X year, and treat- 
ment X place X year interactions. 

In experimental programs of this type, it is usually hoped that the 
places and years constitute a representative sample of the population of 
places and years to which the results will be applied, There are obvious 
practical difficulties in choosing places and years that can be confidently 
asserted to be such a representative sample, and sometimes little effort 
is made to ensure that this will be so. The hard fact is that any sta- 
tistical inferences drawn from an analysis of the data will apply only to 
the population (if one exists) of which the experiments are a random 
sample. If this population is vague and unreal, the analysis is likely to 
be a waste of time, at least from the strictly practical point of view. 

It seems appropriate to regard all the quantities in the equation as 
random variables except the general mean and the true effects 7; of the 
treatments, because if we could tabulate all the values of say (a7), in 
the population, they would follow some frequency distribution from 
which the values in our data are a sample. For the full application of 
the analysis of variance we require the assumptions that the experi- 
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mental errors and all interactions of treatments are normally and in- 
dependently distributed, with variances that are denoted by use of the 
letters that enter into the interaction. 

An unweighted analysis of variance of the treatment means is ob- 
tained by the standard procedure for factorial experiments. The im- 
portant parts of his analysis are sketched in table 14.12. 


TABLE 14.12 ANALYSIS OF VARIANCE OF TREATMENT MEANS WITH TIME AND 
PLACE VARIATIONS 


Source of 
variation df. Expectation of mean square 


Dineen 


Treatments ¢t—) od + otpy? + poty® + you? + py Ra) 


Treatments 
x places (— 1) -) oe? + atm? + yop” 


Treatments 
X years ¢=HD¢—D oe + otp? + pow? 


Treatments 
X places 
Xyears  (¢—I(p—I(y—1) ae? + py? 


Pooled error oe 


In the most general case, o.” is the average value of o;7/riz, Where 
oi is the error variance in the individual experiment at the 7th place in 
the kth year, and 7x is the number of replications in that experiment. 
Consequently, the pooled error should be estimated as the average of 
six’/' Vike 

The method to be followed in testing the significance of the successive 
terms is made clear by the expected values of the mean square. The 
treatments X places X years interactions are tested against the pooled 
error. Both treatments X years and treatments X places are tested 
against treatments X places X years. 

‘The situation with regard to the test for treatments is interesting. It 
is evident that no other mean square in the analysis is suitable as a de- 
nominator of F, unless either the treatments X places or the treat- 
ments X years interactions appear to be negligible, so that the corre- 
sponding variance is assumed to be zero. If this happens, the other 
interaction m.s. is an appropriate denominator for F (subject to some 
uncertainty in case the assumption should not be correct). 

When both two-factor interactions are present, the hypothesis that 
all 7; are zero is equivalent to the hypothesis that 


0: + Oipy = Sp + Oty 
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where the @’s stand for the expected mean squares in table 14.12. No 
exact test for this kind of relationship is at present known. By analogy 
with the F-test, one suggestion is to use the ratio 


2 
Set Sipy? 
2 3 
Stp ate 8147 


where the s? values are the mean squares in the analysis of variance. 
This ratio does not follow the F-distribution, but following an approxi- 
mation suggested by Satterthwaite (14.7) and others, we might use the 
F-tables with ny’ and no’ degrees of freedom, where 


(62 + 8p??? 


ra 4 
St Stay 


ny! 


TM = Mpy 


where the n’s are the numbers of degrees of freedom in the corresponding 
mean squares. The analogous expression is used for na’. 

After completion of the analysis, the next step is to examine the nature 
of the interactions and average effects of the treatments, bringing to bear 
any external knowledge that will throw light on the interpretation. 
Often, for the practical uses of the results, we would like to decide 
whether a single recommendation can be made for application in the 
whole population, or whether, on the other hand, it is necessary to have 
several recommendations for different parts of the population. These 
decisions are facilitated by calculating confidence limits for the dif- 
ferences among the means of the most likely candidates. For a discus- 
sion of methods for making this calculation, which can give rise to some 
complications, see reference (14.5). 
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CHAPTER 15 


RANDOM PERMUTATIONS OF 9 AND 16 NUMBERS 


15.1 Use of the Random Permutations 


In practically all the experimental designs, the randomization con- 
sists in arranging a set of objects, whether treatments, blocks, rows, or 
columns, in random order. If the number of objects does not exceed 9, 
a randomization is obtained at once from table 15.6, which contains 
1000 random arrangements of the numbers from 1 to 9. For example, 
to arrange 7 treatments in random order, select a starting place in table 
15.6 (without inspection of the numbers in the table). Suppose that the 
permutation chosen is 


15859) 2 60,7 5 8) 4 
Omitting the digits 8 and 9, we obtain 
ee Oe be 3. 4 


for the desired arrangement. Similarly, table 15.7, which contains 1000 
permutations of the numbers between 1 and 16, may be used for any 
number of objects between 2 and 16. 

Occasionally the randomization involves dividing the objects into 
groups, as in the completely randomized and cross-over designs. Thus, 
with a cross-over design (section 4.4) having 2 treatments and 10 blocks, 
treatment A must appear in the first row in five of the blocks, chosen 
at random, and in the second row in the remaining five blocks. Select a 
random permutation of the numbers up to 16 from table 15.7, say 


6) 16 AIO RG 12s FOS. Ti 21 ares ie si co: 
The numbers from 11 to 16 are ignored. The first five of the remaining 
numbers, 6, 10, 5, 7, and 9, are chosen as the blocks in which A appears 


in the first row. 
If the number of objects to be randomized exceeds 16, see section 15.3. 


15.2 Construction of the Random Permutations 


The idea of using random permutations instead of the ordinary ran- 
dom digits was suggested by Professor George W. Snedecor. Before the 
414 
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writing of this book, 200 random permutations of each of the numbers 
from 3 to 13 had been made at the Statistical Laboratory at Iowa State 
College. These were obtained by first constructing a large number of 
ordinary random digits, which were then transformed into random per- 
mutations. The method of transformation was the usual one: it may be 
illustrated for the numbers from 1 to 11. To obtain a random per- 
mutation of the numbers 1 to 11, a series of pairs of random digits is 
taken, say 
65, 04, 29, 82, 37, 52, 53, ete; 


When we divide each number by 11 the remainders are 
10, 4, 7, 5, 4 (omitted), 8, 9, ete. 


This process is continued until 10 of the 11 numbers have appeared, 
repetitions being omitted. 

Rather than present 200 permutations of each of a series of numbers, 
it was decided to concentrate on two numbers. This decision was made 
partly because any permutation of 9 numbers, say, can be used to give 
permutations of all numbers less than 9. Thus 1000 permutations of 9 
numbers are more useful than 200 of each of the numbers 9, 8, 7, 6, and 
5. Also, certain numbers, such as 11 and 13, are less often used in ex- 
periments than others. The numbers 9 and 16 were selected for presen- 
tation because of their occurrence in factorial experiments. 

The 1000 permutations of 9 were obtained from the permutations 
already available for 9, 10, 11, 12, and 13, by omitting the numbers 
above 9. Of the permutations of 16, 800 were obtained from random 
digits. With as many as 16 numbers to permute, the method of division 
as illustrated above is rather slow. A more rapid alternative, suggested 
by Mr. Paul Peach, was used. In this method 16 pairs of random digits 
are taken, for example, 


75 10 17 28 91 85 74 56 71 06 26 O1 06 21 O07 60 
14 5. cel O16 1b ee 12 Seon oleae 


The permutation is produced by ranking the pairs in order of size, as 
shown below each pair. In the event of a tie, as happens in this example 
with the two 06’s, the two ranks in question may be allotted from an ad- 
ditional random digit. Ties occur frequently (the probability that all 
16 pairs are distinct is about .35) but cause little delay. Although the 
method has the additional merit of using fewer random digits than the 
method of division, with 32 random digits per permutation a large supply 
of random digits is required. The remaining 200 permutations were ob- 
tained by drawing numbered marbles from an urn. 


416 RANDOM PERMUTATIONS OF 9 AND 16 NUMBERS 15.3 


15.3 Randomization of More than 16 Numbers 


The most suitable method depends on the facilities available and on 
the frequency with which randomizations have to be made. Navy beans, 
drawn from a box, have been found quite expeditious and convenient, 
though the numbers on the beans rub off with repeated usage. Al- 
ternatively one of the standard sets of random digits, references (15.1— 
15.4), may be used. For example, with 25 numbers to be placed in ran- 
dom order, choose a starting place in the table and select the next 25 
sets of three-digit random numbers. These numbers are then placed in 
inereasing order by the method used in the previous section for the per- 
mutations of 16. Three-digit random numbers are taken instead of two- 
digit numbers in order to avoid ties. With three-digit numbers the fre- 
quency of ties is negligible even if as many as 100 numbers are to be 
randomized. If punched card machines are available, the numbers 1 to 
25 may be punched in columns 1 and 2 and the random digits in col- 
umns 3, 4, and 5 of the card. The permutation may then be produced 
and printed by sorting on columns 3 to 5. By filling all the columns from 
8 onwards with random digits, a number of permutations can be obtained 
by sorting on different sets of columns, though the number is limited if 
the permutations are to be kept independent of each other. 


15.4 Tests of Randomness 


Many tests of different aspects of the randomness of the permutations 
could be made. $o far as their use in experimental design is concerned, 
the test most immediately relevant is a test of the null hypothesis that 


TABLW 15.1 NomsBer OF OCCURRENCES oF 1, 2, +", 9 IN Ist, 2ND, +++, 
Qu PostrioN 1N 1000 PERMUTATIONS OF 9 
Position 
Number] 5. [gy Bs doy Gin ,6) den Se Oe [oes 


130 111 111 108 94 121 1i4 106 105 | 1000 
100 118 104 102 102 133 98 108 4115 | 1000 
92 100 116 111 113 110 116 123 119 | 1000 
11 98 97 108 117 112 124 102 121 | 1000 
103 120 140 122 4 113 113 110 | 1000 
107 118 118 88 113 120 124 111 101 | 1000 
108 119 121 124 110 94 95 113 116 | 1000 
15 «116 «110. 98 107 109 4117 121 107 | 1000 
112 117 103 121 122 117 99 103 106 | 1000 


CONAnRWONeE 
S 
on 


‘rotals | 1000 1000 1000 1000 1000 1000 1000 1000 1000 | 1000 
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the numbers 1, 2, ---, 9 have an equal probability of appearing in the 
lst, 2nd, --:, 9th positions. This test is made by counting the number 
of times that each number has occurred in each position. The results 
are shown in two-way presentations in tables 15.1 and 15.2. 

For the permutations of 9, the expectation in each cell is 1000/9, or 
111.1. Each row and each column in table 15.1 adds to 1000. Taking 
the values in a single column, we may test by x” whether the 9 numbers 
are equally represented in the corresponding position. If the a’s are the 
observed entries in the column, 


z—my 9 1000\? 
vos m : a 2 9 ) 
For computational purposes, this simplifies to 
x? = roo dX (x) — 1000 
Thus, for the first position, 
x? = zp8yg(130? + 120? +---+4+ 112?) — 1000 = 10.81 


with 8 df. Similarly, from the rows we may test whether any given 
number has appeared equally often, apart from sampling fluctuations, 
in each position. Although the two sets of tests are not independent, 
both are of interest. The values of x? are shown in table 15.3. 


TABLE 15.3 Vatues of x? From 1000 PERMUTATIONS OF 9 


Position Number | 
test x? test x | 
1 10.81 1 7.46 ! 
2 5.19 2 9.17 

3 5.08 3 6.70 
4 17.16 4 7.75 | 

5 6.06 5 18.90 

6 15.52 6 8.97 

xe 8.32 7 8.25 

8 3.38 8 3.45 

9 3.44 9 5.32 
Totals 75.96 75.97 | 
{ 
| 


In the position tests, the individual probability values range from 
90 (x2 = 3.44) to 028 (x? = 17.16). With 8 df., the probability that 
the Jargest of 9 independent values of x? should exceed 17.16 works out 
at .23. Actually, owing to the fact that the cell numbers add to 1000 
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in any row or column, the 9 values for the position x” have a small pos- 
itive correlation which reduces the probability value for the largest x? 
to about .20. The individual probability values for the number tests all 
lie between .90 (x? = 3.45) and .015 (x2 = 18.90). The probability that 
the largest x” should exceed 18.90 is about 0.11. Thus neither series of 
tests indicates any marked departure from randomness. 

The total, 75.97, which is >} (« — m)?/m taken over all cells in table 
15.1, supplies a composite test of the complete table. On account of 
the positive correlation referred to in the previous paragraph, the value 
75.97 does not itself follow a x? distribution. In fact, it may be shown 
to be distributed approximately as (9/8)x?, where x” has 64 d.f. Con- 


8) (75. 
sequently we may test SAE), or 67.52, as a x” value with 64 df. 


As would be expected from the results of the previous tests, the prob- 
ability value, .36, provides no evidence for the rejection of the null 


hypothesis. 
The corresponding x” values for permutations of 16 are given in table 


15.4. 


TABLE 15.4 Vauuns or x” rrom 1000 PERMUTATIONS OF 16 


Position Number 
test Mal test x2 
1 14.40 1 20.25 
2 17.70 2 12.96 
3 11.87 3 16.58 
4 15.14 4 4.80 
5 17.70 5 14.94 
6 7.42 6 9.57 
ve 12.29 7 13.98 
8 7.42 8 13.63 
9 24.54 9 15.55 
10 9.15 10 12.45 
il 10.62 ll 14.05 
12 16.51 12 18.56 
13 9.92 13 14.50 
14 18.85 14 13.06 
15 6.98 15 14.78 
16 19.36 16 10.21 
Totals 219.87 219.87 


The only figure in the group which has an unusual probability is the 
low value 4.80 for number 4. This has an individual probability of .9932 
(15 d.f.). When account is taken of the fact that this is the smallest of 
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the 16 values for the number tests, the probability is found to be .10. 
The total, 219.87, is distributed as (16/15) x? with 225 d.f. and is found 
to be slightly but not abnormally below its expectation. 

‘A further test of randomness was made by counting the number of 
inversions for each permutation. The inversions are counted as follows. 
For each number, record how many numbers to the left are greater than 
the number, Thus for the permutation 


Sindee SalsD puleay Oe. 


we record 
000 8 02 5 3 4 


The total (3 +2+5+3-+4 = 17) gives the number of inversions. 
The extreme values for the number of inversions are zero (for the per- 
mutation 1 2345678 9) and 36 (for the permutation 98765432 1). 


TABLE 15.5 OBSERVED AND THEORETICAL FREQUENCIES OF NUMBERS OF 
INVERSIONS FOR PERMUTATIONS OF 9 


Number of Observed Theoretical | Goodness 


inversions frequency frequency of fit, x” 
0-4 8 6.3 0.46 
7 4 6.0 0.67 
8 ll 9.9 0.12 
9 10 15.3 1.84 
10 13 22.1 3.75 
ll 36 30.4 1.03 
12 38 39.7 0.07 
13 50 49.5 0.01 
14 71 59.1 2.40 
15 65 67.8 0.12 
16 80 74.6 0.39 
17 7 79.0 0.81 
18 90 80.6 1.10 
19 75 79.0 0.20 
20 80 74.6 0.39 
21 57 67.8 1.72 
22 58 59.1 0.02 
23 45 49.5 0.41 
24 40 39.7 0.00 
25 27 30.4 0.38 
26 27 22.1 1.09 
27 18 15.3 0.48 
28 7 9.9 0.85 
29 il 6.0 4.17 
30-36 8 6.3 0.46 


in| 
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In intermediate cases the number of inversions serves as a measure of 
the extent to which the order is inverted from the natural order towards 
the reverse order. The frequency distribution of the number of in- 
versions has been studied by Rosander (15.5) and Kendall (15.6), who 
derived from the number a measure of the rank correlation between two 
series of figures, For random permutations of 1 to 10 numbers, Kendall 
also tabulated the exact distribution of the number of inversions. His 
distribution was extended to 16 figures for the test given here. 

For permutations of 9 the observed and theoretical distributions are 
shown in table 15.5. 

The total x, 22.94, has 24 d.f. and agrees closely with expectation. 
The corresponding value for permutations of 16 is x? = 47.52 with 54 
d.f. and a probability value of .72. 
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15.5 Tables of Random Permutations 


TABLE 15.6 Permurations or 9 
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TABLE 15.6 Permutations or 9 (Continued) 
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HAM ArOMND 
MAM MON dT OO 
RHONND MOF 
AntoNnnwarn 


MOrOHTHANAW 
DArRrHHtMADOW 
MHOND A Orc 
ATerMnonorn 
AMOAOrHAWMDH OD 


RMODNHAHON 
HWAMDDONAOE 
OATEH MONO 
Ah O19 9 HOMO 
OMAANM 19 0 dH 


ADDROW M ING 
AW OOW AIOE 
WOAH ADH ON 
OnAHtH MADINA 
DHArMAnondina 


AM MOMOACHNH 
ONAN HDA 
ADMWHHM OHO 
AMHORBON HH 
MOMOMHNHAD 


ANAM MDH OOH 
HO MOnRATAAD 
AotnhOnOo HOD 
HN OnNNAM HO 
rORANDOWND 


RAR AOMDHONH 
MADHtHOIANAr 15 
TAN DOrROADMHN0 
ANCOnAHO 1D 
DOMmnnantano 


RDWOMAAWW 
CONatOnws 
RMDOTAADN 
HAMM OHOnN 
THRARMOMNAAN 


RAanonnton 
AwNDotorn a 
DOonmnrnodr 
MH OHOnMAS ID 
rAMHATONAW 


MAnwmonrvodtrio 
OnMtr ADAG 
CONAMMAH HOD 
ANDMaAnnrniod 
AtnROnoOMDH 


AM OIMHANOND 
CARP MANKNDCD 
ROANtHNAOCAM 
Ar OMANHDORMO 
AANMDMDOHAN OD 


ANHAOnMMNDG 
IN HA AAD © 
MOmMINNT A OH 
AOAMWNM tr 
MOAMtHHAAnNOwW 


RNHANWDOHTN 
Ornmnntonrn 
Mr MMDAAHANO 
RiINAntMoaworn 
Orrtnonan 


HAMM OOnN 
AMOrOHMAND 
Honora onn 
MANHDNONDA 
mt mMMonNonan 


ONTRMAMAE 9 
Oraravotanmnaw 
HFODHANMANM 
FOAM HHOOD 
HOMMNOHAAY 


RNANHDOAND 
Qonntononrn 
ANDOMrHMOAH 
DArRHANAMDOMHA 
AMDONrmINAH 


MOMORHHHN 
MOMATND IND 
HANH ID On DO 
NAnwmDanmnnmrdto 
HO MHAMOND 


CORMABDHAM H 
AMHMODH AW 
ONMOHOHHN AH 
Ordoannin 
HH OMINRONH 


AWDOMINN AHH 
MHORHNH AOI 
OrmroOtAADAH 
AwMOCHM OMA 
HOMANAHMOM 


ID AIAN DO OH OOH 
OMMOMDrRAAH tH 
CHANHINM ADM OO 
WARABOHNODE 
NDteMMAOD 
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28363 47821 48413 14437 79287 
2 
8 
7 


mNnOrRAoOtAN 
RNHMANOCAr 


DANN HoOnor 
ADMAHOnR DH 
COmrKRMNANHMD 
BRADMAN HON 
HONRMAGDOAM 


MONT OMS 
RtOAMon 
ADrHNManod 
DOM MATA 
OANAHTNOANMA 


RPMODOWAANA 
mMOAmmaxdtonna 
HTRDRAONAON 
ArRWNAWDOtHA MY 
ACrHNDMANoH 


AAOntnronn 
ADOnHNM 1D 
MO AMDAOCHEH 
aAMHMOrODAN 
RHDARAMNS 


Mr AMHOAAH 
MNKmAWAHAOH 
HMHOANOSOHMMD 
BRAMMADAHAS 
DMNHAnnrorn 


Aton ONnaMaAw 
Nonnmotnonh 
HOANNNDNMANL 
HSA Mrowmwnana 
HAMDAOVNHT 


AMON ADAANM 
AAMHOrMANS 
RNDAMANAOW 
RHA DMNATAW 
HaAaMAMONND 


ROTO NBDAN 
OID RD NOW H 
mAADKROMMOA 
MM AOKATS 
DHOMNARY 


HAMAD OAW 
AONNrOM HO 
CHANT NMD 
DAnnAaraneo 
NHAOWONMDE 


HAM rHAADO HO 
WMOMOONnR 
AROTAMNWOS 
ARTMOAMAD 
HMORMATE 


HOMAhH AON ~ 
DANtKHMAAON 
Aowdnhnoton 
ArMnMnonte 
AW anmonton 


25444171151 


MIDOn ye AHN 
ADAMMHHASO 
HMMHANnOND 
ROOMAADIW 
CANOWNONHMO 


ROMNANTDSH 
INM— MA OOMAD 
AnwatonmAMID 
wHOHROAANH 
MINOR AAO 


IMO AnMWAT HOD 
DAHHNADOWH NONE 
RHA OHANDNM 
HTHROMNAAAD 
RHNOMHAND 


SAAN DOMnrH 
H+OWMRANMADH 
SNotHwmnwso 
MNHHAMAD AD 
CMH MOABDOrMH 


WMAnrOMtMON 
AW OCNDHRHOMEM 
AMAR DTNAGO 
AOnmortnn on 
RPOHNMANDGD 


mNOAOMOHION 
ONrMWMA OM OO 
Onvownrdn ot 
mINAtoMmAanNA 
CaOtnnmomnnn 


THONMNADMAW 
HNOMDABWODY 
CHOMAKHAM O 
ANAOWT ONO 
RAWDAMHON 


ARMA HAAM OS 
HOR MAT ANS 
BOORMAN MDMHD 
HH OM ADM 0 
AMDMONMAWE 


BHOANrRDMWH 
CHOMMrEAHA 
MORNIN AOMO H 
AovnoanatE 
RHOMARWDOAH 


AmMmMNHoOtnr 
DMR AAMOWA 
Anenmtorno 
mheMAtHNAOA 
FONMNAATN 


MOK ANDOMS 
AtHHDRADH DH 
wAanANDtOMhy 
SNM MHwoOond 
DOWANAHNKRHO 


CiNNMeOtnan 
HOHOrNANM 
RHNDATOWA 
Mr AoOMOANH H 
MAKhHHHDOMAR 


MAM ertoon 
MHoHNMON Dw 
DAN ROAHO 0 
mAOMAoHtonanw 
RONT HOMO H 
CAnAWNAN OD HH 
CNHAHHDMAD 
QtH MM OINNS 
RAM HOMNDAD 
mr ANM MO dH OO 


TANOPAHOHDM 
HANARORM DHA W 
ONMM AT +H 19 1 
ROMNMAOHTH 
WDAOMrMOWA 


DMOMAKHASH 
SODMNHhRAH 
OMHANAAnNON 
ArwNatnnraoo 
MAtTANHADMS 


MAAORN OD HO 
MOTMrANDN 
HACMrAOMANA 
RWDOHONMWMAMD 
MrHAARDOAG 
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AhRowmd 9 OOH 
HMOONdthr OHS 
ADK wMAMHOD 
SOonntmor9 
OnNnrtnHr ood 


BRMMDMNAANOW 
CMWATKHDHAOD 
SOnANDOMN OH 
Omtrornnonw 
COAT ORNH 10 


OAWMOANMOAN 
MOOnDHANNM 
MOHAN DHAND 
OMNANHHNOD 
ARMHorowm + 
HMOHNANDMSO 
AtHMeOrnDoM 
DAotarnwon 


HOH AMANO 
RMOHHAMRAOH 
OnrMHtNoAAN 
AMM OHDAHHW 
POMONA A +O 
ARHDHnMHOW 
OMMANADHE 
HHOn MODAN 
MODIDHAAKHN 
OFNDDAHATHOD 
RADHNOMHOF 
Hon MNOrRa oO 
AMontNdAn oO 
HONNMoWon 
ANAtDoOMnN 
OAHKROCANWN 


OtrHoNnwmonwn 


DWH MOMWAAHA 
HRAONNRTONM 
DARAHTAAOHH 
AodrMotanE 
KRAnnntmon 


ROAODATAN HH 
Aonrnnatnn 
ADHAaMNMOoOnNd 
ROMNAHAMAH 
OMIM ry ANS HOO 


DONNMHAND 
HONMD AHMAD 
AHONADMOMINN 
MOE AMHANOA 
AArDANowWn 


HADnNnatneo 
DAMM MAHAOS 
MOrarti oo 
ARrROHMIN OH 
MONDMAAHE 


WHIM AMDOAM tH 
ROADHOAHIO 
MHAtrKANDAN 
OANMAMAMDHINH 
WAMHARMWODOH 


AMO NAAM 
WOMOM AIA 
Ammar oOONH 
Aranmmotnr 
COnntHoNnw 


HOM RODAND 
AOnMamMantnNS 
DHOMHANNKnD 
HAT OnN MOAN 
AWraAnMoNMDIdH 


NANOnNMSd+H 4019 
RAOnannndtoo 
BRMMABDDAWO 
Aonmoadtnonho 
AMONOnNMAN 


ANMHANDNO 
MHOONMDHOWN 
MM HADOAAN 
CHATHAMDDN 
ARHHDONANS 


RMOAWDOWH MOO 
AAMMKHAMDO 
DOMNDrRAWWN 
OMWDHOMADN 
WOW A 


MANNA tOo 
CAnAHDNrEACSD 
DMAMINHAOW 
Aton nannnon 
MOnrHONOH 


NHK HOMONAA 
Wr DOH A CO 
HTRMONTANAD 
MMAeAtTMOADO 
CONtNM AOD 


ONAMNAONH 
ANaAtTHNHOMM 
DOotrnwmarnnan 
ADMIONADHN 
HAN NMAMOAA 


Ahr AnMntaow 
INMOAROnINA tH 
OnRNCAHANHH 
wmAMhHHRAOCOH 
MOOMWKHAODHN 


ChRASANMHANAMDH 
WABASH OH 
MH MDH ONS 
OHAMONHANW 
Rot MDOAAN 


WMA ONOnN at 
RADROHMAH 
WMOrOMDAHAAH 
ADOAN MH HD 
AOtronnnd 


ONRONDWHM NH 
HAR CHMOD 
HAMKR ADO 
Dotn won 
HH AOMannrne 


OMAAAH Em 10 
MAHTMADOMON 
BRHANNDOH OD 
ACKrMNONDHdSH 
CAtor nN on 


HHA OAMNAN HD 
COMnHOAMDH 
WACHDHAN AH 
RoHoOAamMmotna 
WOODHAM HAA 


MArMHONDH 
OnMAHDAMAMH 
MAA DMOAHTE 
AROMA MN HAH 
NAM OM AHO 11H 


AtMH Mor OCHO 
AMMrEARONHK 
CAMDHOMAN 
OCAMHHANDMDO 
WOAARNN~ OO 


AROMOMHAA 
AtanwonHnnMnon 
RHADAMMOWMWD 
HONnNHMNwO SC 
PAODNHOnNAHNH 


WAAMOAr OH 
CHMNWAH 19 dH 
BOR AtNoOM 1D 
MAANHMHANMADO ND 
rPAOMHANHO dH 


DrRawdtnannow9 
MHtHAWDNCOY 
AMIN AON NO 
HRAMAMAMOO 
OrDanaHiDA 


rOAnNAANnHt OH 
HMA DOMrHAD 
ADOnHAHOW 
DAMMMOARDHON 
MatrmHONoO 


MWe HADMOARAA 
DHOMronmndia 
DAM AMON AHO 
DHANMOMHNAM 
MAnMAHOWA 


OMAMAMANO 
OnwtAinwoonms. 
AtmMnDoHMOAn 
rFANDAH HOM 
ROndtarnom o 


AwKr~twmodona 
Ornnnodtn 
Mint reOOWAH 
WAHASCHANM AD 
ArNOWonnn 


15.5 


RANDOM PERMUTATIONS OF 9 AND 16 NUMBERS 


426 


TABLE 15.6 Permurations or 9 (Continued) 


644221839 59659 22718 79895 24254 
1 
4 
5 
9 
8 
6 
7 


OnAHDAMMNNN 
ADOHRMNAOEy 
HOMARANNHD 


RaMmtH OMAN 
BOAR ANOAH 
OHCHANAWHIW 
MANtrornn 
RODMAN GDINON 


Antnonnrnnd 
OHAONnMMNATANM 
RHDANHTOWM 
Annnotonhy 
DAWrnarotre 


ANOhnDAN AHN 
DAMOnMAHAOD 
RHADHOMNANA 
AWM OAH 
HONOKHHM|a” 


DAOMAHMDEHAD 
HAOSCMMOMOHWH 
HR ATHANDNOD 
HOME HAOAM 
HH MDANMMASO 


MHAQHK AO OO 
CMrAMDADAH 
HORnRHAOMAW 
DHAMMDAABDOM 
ROHHAANDIND 


Aoenmnrananowndst 
RAM ADAH 
OCMWROKnAMDHWH 
AWK GCADHAOD 
RAANtHNDS 


MOAHINOhMAN 
DBaANArHoOONA 
MOrMONRH HO 
OCANMMMHDMDAGD 
BANDMOHHOW 


DOAN ANAHH 
DOAnronnaaia 
CHAMANHNG 
MANDA SRWHH OO 
SOrNnHANHMSO 


MOOHHINANN 
BANOMNMrAOH 
DHAAMDDAh O 
AON NDNMAAhy dH 
MORPH NOANY 


MAHAOWM OO 
DHAANMNMOOY 
DAnNonmntna 
HH OAOnNNINM 
AONDMHHEH IDO 


OMOorniawnd 
OOMAANr~wnmod 
HN KM AAMDADO 
RAANTOM HH 
aOnconanmnndt 


Onrantano 
HANmMaononrw 
IMHAMHMAOON 
RNMHOARHON 
MOnRAWAHWH 
BHOAMNANCOHD 
MmAtKr DOAN 
DBNTHRHOMNKhW 
HOMDNAAMDONE 
ROnRAHMAND 


HONMnNatons 
Cantona nor 
CHMDHMHAWNA 
anmoradtwonn 
HM DMONAHSO 


MHtANnM ACH 
ROatwamNo4N 
MN ANHODOND 
AROTOSOMWONNIWH 
HAMHHHASNSO 


HONnonnnod 
BHANnNnonno 
MHMOANnNoNo 
+R ODNDADNHA 
MONNHAMDOMD 


OnHMATAONH 
AwWrOADH HO 
mr ONAN HO OO 
ODMHNDAHNOD 
ME DAAANOHH 


OnMDARMH AINA 
AnOoOMAHONM OD 
CMHNM ADIN 
COMM r~ Won 
DAMHAHNAONN 


HOHAAMNDO 
HODANMDEMANO 
OHArmaAnon 
DAMMAHAADON 
AMHrm OM HON 


MMIHHDROND 
CMNMDME HAH 
Mint DNHANAHOD 
HONMMANAG 
MANnDn HAMS 


ROGOTMMAN 
ADOnNTAMN | 
RNADAMTOHAAG 
ANAND MDH OWA 
MONAORNOY Or 


CHOAMARWNH 
ADAM MOT |DIO 
OO Prt Om AW +H 00 
MWOTHANDON 
MAHMOnNTHNDD 


MArnMDNOHAT 
rOMNNhHAnnd 
AWAHtHNOn MH 
DWHMMAnMAO 
MHANDOnMWMHID 


HOrnnonnm 010 
MAOMOMmHOH 
RADHMARHOND 
ANKHMADR OH 
Ononomo dt or hr 


NOKM HONDO 
CArMPMNAHMAONA 
Aonorndt ano 
rRADOHHMAND 
AornwMAaimnd 


ID MAHDRON 
Or AMMO HAAS 
HOAWMANMHAO 
Orn oOoNnwmin dt 
ANDRMOMNM IND 


Danmorndtarino 
HOMMAATNrO 
HwWOnHONArM AD 
DOoOHAnnnana 
HONONMAMANA 


INMMmHABRHAHOW 
Onmnrmnwaoo dt 
HDNHOMAMAN 
AneOonWoon 
OnMOnRAHOW 
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SO OmMHDARHIwH 


IND HOM ANOMW 
HANOMOMNAY 
AONDMNM HID 
ROMARDAHON 
Aw tim OOD 
mH OANMAONSO 
HMARBAONOM 1H 
HOW MErAHHD 


4243192111 46935 36 
34686 41866 81279 15 


alr eal 
633 
84558 61915 56947 63547 51 


16817 29253 89479 74722 94 


23 
59 


HORANNMOH 
RoAMAOoTOAH 
SDOHOEAMTO 
DOMHONWMHN 
CANMHHADKR|A 


Anotnwmnorn 
MANtHr-AOM+H 


MH trONMH 
OOHOEAMHW 
MATH MADDON 
ONMAMDAHTE 
HAMAM MONO 
MWA MOHAM OD 
Anocmntnna 
HOMmMAACON 
OnnowWANMO dH 
AtONDHAINDD 
HRHONAMNOAD 
RHMAONT OID 
RAMINDODAAH 
AnNMOHNR IO dH 
DrMHNHOAD 
Wr AMASOHAOSD 
AnHONrDRDWIN|D 4H 
DHA MrHOOADH 
Anwowtronnoa 
OCiHHARAOnMOAY 
mAh AHM ADO 
MOAANADM Ot 
AnH HOMDOnN 
HOOHOnNAM 1 
Dor HOANDAN 
CANrATANM 
OnMMDAHON TH 
AQnbrFnNDtOoNnmn 


WoONrnaAse + 
RNA SOMO HASH 
DH OMAAnD 
MODARANATEW 
RNDHAMAOHN 


CHHAWTOAON 
Noondtinoar 
MHtoOm ANH A 
ARMDHNHMN HO 
DNARONWHWON 


ASCANMOr HH 
Aroantnan 


"MINAHHOnNDD 


AMOHAErINDA 
Orn eatianon 


OOM AADMHION 
DAHtaronmnmnna 
HMADHONWME 
DtHAronmnmana 
HOOK MOAWMY 


rHHANHOMDAD 
RHADNHOAO 
AANDNMNAHON 
Omni NOM & OO dH 
AQMmonMmoand 


HHAMODNONA 
ANCHO rAMHA 
CAMrFADMN 
Or Matnarnor 
AMDHONmMAH ID 


~ ON dH 194 Oo 0 
DONMNMANHS 
RANT HMAM DMO 
ADHAMANOMHN 
Qtr nNnnnwos 


HOMANANHS 
NonmnwmtonnrgD 
ANrMmoonno dH 
AAMaAtTrOMO 
PME ADAMDH OS 


TOM DNAKRMO 
WOME HONMDO 
DOAMAMMOAN 
AM MIND HON 
WACOM HDMANH 


Wr DOMAARIH 
atHMOHONAND 
ONDHOWE OD 
WAH rHOMWNAMDN 
IDM OOHMDAN H 


Oorntaonman 
BRHMAOANNAO 
DOMHANMDHEHA 
MHONKrEAMOF 
OMWenAMHOnN 
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8 16 8 12 16 164 9 16 4 2214 5 14 4 7 3138) Gee 
8 14 11 8 9 1444. (2018) J 8 415 16 7 6 15 13 13 13 
12° (03) Geman 10 12 13 14 5 11 10 10 12 9 10 5 16 6 8B 
11 gl eae 13. AD 4 2: 7 36, WoenBe 8: 1 12 6 25 at 
1. 16; 59° aoe 7 18 15 5) 4 3 7 16 8 12 15; 72) (5) a0 
13 8 {1 2 a8 e459 °F) 16 515 4 6 4 110 6 1 14 
7 12 10 10 5 16 5 8 16 2 1228 6 13 14 18 18: 3) 8° 87 
10 15 15 4 14 1 16 16 12 WW O idea <2 10 il 18) % 46 8 
16:7) oes 4 710 6 10 Re Sen Cae) 16°) 6d, 8092 
eee eG 3 13 14 15 15 6 12 .9 15 15 9 16 15 15 10 
8 6 16 5 15 ite 1a ede oS 10 8 3 14 13 9 1 10. 18793 
2) 10) oe; ae ae 9 8 6 ll 12 15 9 7 5 2 8140 a 4S, 
Ba oS eatbae 2 15 11 10 14 fe ae ee a 8 Ith 10)ee2 
4 \1- 499) 32°46 6 1 8 4 16 Bei ta 7 19, iBT ae 
6 138 14 6 12 165 9 Poss 8 4 13 12 10 5 5 4 9 12 16 
16. "eo 8g: aeycio 11 10 5 9 13 142) 16. 39.78. 49 14 11 15 
14 2 18 16 6 26 7 8 9 26; 18. 5) 16 4 8 8 14 6 
1, 91d aye eB Ge) a7: 11 11 9 13 13 7 410 16 9 
9: 78: 10:ae es 5 5 13 15 9 1413 14 9 9 4 8 415 2 
13° 6 15 10 11 8 16°18; 4) 15 B 4. 8 6 4 5 12 14 14 38 
8.066. eer oun: 9 nS? U8) #8) 23) dita) aS, eS" It 2 9 16 10 12 
11 12 9 14 16 1 -4.-08. 1 <4 815 6 15 7 1016718 iy, 
10: 2g 96u aL 1312 9 8 6 7 Sigs Ay: WR 8:18 9 8410 
16: 1 teste. 07. 459) BE 7 2 6 10 16 10 6 4 58. 40: ot 78. 
es ee 114 2 10 18 16. ey 46 34 46. 6 15 12 11 16 
8: 6) ai Ma) te 8 11 16 14 16 iO cis 18.72 1652 i 92 "16 
7 10 ge Suude 6 2 4 13 14 15 610 11 8 1210 3 8 8 
8.00 (8 (8045 FE 083 SF 4°38 1% ig 44 15 44 <2 6 44 
16; “8; seine age 2410) £9 £8030 10; 20a ae 8: 3 7 18 18 14 
6 33) F2"%5 Bp 7 13 10 16 15 12 16 1 14 16 14°58) ot TORT 
12 16 16 4 10 1 7 19 8 2 5 12 16 “12 Chas Cer ae 
442 om Deke 122 16 ll 6 11 Sad eyes: UT 10 1 5 12 6 
14 14 6 li 12 161 4 MW 12 13 9 11 12 10 1305 8 5 i 
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TABLE 15.7 Permutations or 16 (Continued) 
16 2 16 13 18 20 4 4 3 12. TTL tls Gr 4 13 
7) 10) (8) (2 19 5 14 15 5 16 oy ea aC eH | B05 
916 3 8 4 Sunrise ik emp coche gt Sage 
8 1 4 ll Ww aut wayp kOu A. 1% 64 5 16 9 9 16 
3.15 ll 14 8 10 10 10 12 10 yet Real 11 12 
19° 748 15 12 Esty Ewch aa 10 12 10 7 14 15 14 
15-18 9.8, 8 14 6 16 2 13 8 14 15 14 12 Tae Kt 
10g) Sickel varie 61 2 416) 15 3 16 3 6 16 6 8 
Gawiles. 56a 08 eal bk py aa) TVs SB HO ahd 16 10 
11 8 16 16 5 i318 56 45 2 15 (5112 8 “8 10 11 
13 9 10 1 15 169 «#9 38 14 6 6 1 10 13 13° 9 
VEST RS atl 13 3 4) 4 6 410 4 4 15 3 15 
Bite) 7) e210 1198) 28) aha 4 aS maiel6 E10 S28 
Pe Ree ake 416 5 10 5 14 13 6 12 5 12 4 
44 12 7 16 4 13 13 12 WD s@N ies ied 14 6 
178) abi, OL 2 OF F2o 1.50 Oe 9 7 16 13 8 vy 
184 8 11 16 8 7 16 (8-16 1 10 1 15 16 10 11 
aeons 16) 16 11 uu 4 8 3. Be She8 6 vie 
2 2715 5) 37 716) 75) 3B N7 1 8 11 ll 7 16 12 
3 8 12 10 8 6 4 4 1 10 Ol ae eMy MTom t 3 15 
615 4 6 38 10 2 12 15 6 6169 4 8 12. 8 
5 6 10 2 13 15> "8. °7. 16" 3: 2 12 10 9 12 4 9 
Pig heh ter pe haps ede 12) 4 16°25) 16 6 4 
9 8511 is) (2 8 12 9 14 18 13; RON (Seo 18 sh 
SAS” et {2 56. /18° 78421 14 5 3 16 4 13 10 
10 2 14 14 15 Ce Sg eos 815 24 9 ea pt 
SI seas en 9138 5 6 4 11 13 16 12 10 Saris 
16,8518 (9 25 114 6 10 1 169 6 10 14 5 6 
11 10 16 12 12 12,10 1 12 14 7 1412 8 38 4 5 
155 9 16 9 i3°9 1 9 5 4 4% 18-2) 5 8 14 
on? 07" 68) 10 2 510 13) 15 Bomeniie ot. te 2 16 
8 4 18 4 11 A ith Beats 23 1SAdy Wes iat 9 13 
ips SO) <8) +S, 16 4 10 6 10 12 13 16 6 12 10 5 
S8U10 12° 8 7 TV Ses eT UT Chest 1stal2 xy 12 16 
Tombs, ke at 12 15 12 14 4 10 14 11 12 4 15 13 
LOGEA 9) OBIS By tehnoy 4 742) Cael audlOy 10 6 1 
7 6 15 14 15 Aber a7 504 06 216 10 8 il 144 
TEL TI 8) 09 Tiga Ee 16 158. $6) 4a 15 13) 
1978. 4) 2. 8 3 21) 48543: 23 DO oLLe De 43 36 2 15 
14469 7 16 14 110 16 16 6 8 15 3 16 14 16 14 
5 12 138 4 4 1414 4 5 7 165 M4 i 8 Bae 
4 13 14 13 6 6 Sid 84 39 1310 7 7 9 ec} 
8 14 16 12 12 te gue a CE) 4 12 12 5 16 9 11 
16 3°10 11 9 138 5 12 16 Be o) B= A oo 13° 8 
6 92 9 4b Ss BOWS, Ste 1 6 9 13 5 hee] 
16 8 6 6 16 eh esa ae 3 4 15 15 18 (a 
Gite 7s 10. 5 6 6 15 14 PSS Sb 8 10 
2168 At °2 10 5 15 3 3 fa! §8) 2) et 2 49 
(et ae ae ee) 146116 UW 5 7 15 11 10 14 10 
Teoh non 18 15 5) 12° 13). 9 Be 6 Siig 65 ig 
Sats SChaes 7 11 -18)10 5 6 13 6 5 18 9 15 
1. PETG, Wed 6 12 4 2 10 1664 3 16 2 “8 9 
Na AB 4 215 8 12 14469 5 10 15 5 13 
8 11 10 10 15 12 8 11 16 13 10 10 10 3 4 12 01 
B” A= Gy Gab 163 2 8 4 ETL Vd Wa) 8) 7 
Par iee i te 6 de Gr at Af herr aatstiy 4 14 
16055) te 8 15 5 13 8 15 16 8 9 9 11 16 
13 10 6 12 6 11 10 6 15 15 STATS 18: 18 1 5 
10 +3 14 4 B04 P13) it 3 14 14 6 12 3.8 
12 15 3 14 8 SS 614 F1Ar6 9 15 11 4 6 te 
3 16 9 15 2 P43) 9% 16 1ST Asya 16 3 
Ag 4 12°10 10°13 9 6 7 122 8 16 8 11 16 2 
15 13 13 16 14 216: 18. 7 8 45) 4.44 13 12 
Tee) 19. 67° 7 18747, 2004" 2 11 12 12 12 16 6 4 
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TABLE 15.7 Permurarions or 16 (Continued) 


1 10 14 16 12 144 3 10 16 15 a6) 55% 427 56) 29| B RAde! Gee: 
10 16 15 3 4 AU M6UNG) 23, 76 115 10 2 4 3.12 11 12 7 
AGS ie ee DO ae See) eS) 4 44% de 8 40 16) (7 4: 39 
14 3 18 #9 I 8 13 16 8 9 6 13 6 15 14 5 15 4 3 14 
4 12 16 15 5 2 12 7 10 5 BY a bt La Bi NE BEND RE 
2 2 9 56 9 1b; 40 2) Tait 9 8 12 1 16 16 16) §E 19)815, 
in 36> SB ET WO 13 6 8 15 7 Toews: UB, 12 0 18 40 
12 610 6 6 pe ssenhyn aut 2 9 8 12 15 di 914 Sie 100 es 
15 4 8 18 13 107 Pb 0) 99s 26 5 sb ea ete adc) Bb a6) ' 
46 da! 2) at 2 9 4 13:13 4 165 PL 4s et ids a a S88! 
Biorih cy Me AS) we SEER ROW eI: Bi 208" Lb 43> A0) 16°. 13% VL 
9 13) 4.12 3 6 8 1 14 14 13 6 9 10 10 7 +6) 10; (8 is 
TBE Goce 25 12 9 6 il 10 & 16: (ft 9) I 4 8 2 4 12 
ib oh sae 2) 6 4 412 2 7 10¥ 13) 38) 15) 6 el) est 16) 36) 
6: 9? 1d) Or ok 7 10 4 7 18 10 415 4 6 15 13 16 Il 5 
8 5 12 8 16 3 15 12 2 12 5 3 14 16 2 9 (2°38) 18) 1 
‘Bi A SS) 36v 146: 14716) "12: 86. 3; BSis E033: 8. 88) 1610), ae 
a De ON ae Le: 13 8 8 6 14 3 4 16 14 14 % “6 10) .0' a6) 
15 8 16 10 12 6 15 6 3 10 1314412 7 6 415 3 14 6 
13) SR a8) oF 9. C10. BE 58 16h 27 0 78) 10 9S 2 BAe 
WATE ds Ole wei ta) 4 10 ll 15 13 144 16 10 11 5 5 16 13 6 15 
A <8) EL) Gs 6 341 9 1 LY Ga 6816.) 2: Gierd 10) Te! 
3 16: 4 48, <8 1212 5 1 6 613 5 9 4 Lt £974. 86 ad 
6. OE as OL 1S Tort V7 H18> 4S Dai 49) 1S? oS 4 10 16 16 10 
Ag One UB! B2> SO 8 ll 2 14 16 oe te BS! OB: 13 12 15 11 18 
12 Ths *) By “2 i Sete) ky #5; a BO: be 7 St 10: 914 8 12 9 
2 14 18 12 2 2) 44s 55: 9) dO 121563 a 1 FS: eS: 3 t 
eas Ne SS a me Se 12° 11 “6: 36° *9 rele Ese, 
13 15 12 5 38 PE Ga) AG 7 SSID 2 ED it 1665 «12:18 «2 
1112 4 #9 10 Bs Wie a 9 6 38 4 16 2) 12) Ag 
1 10 15 14 18 10 6 16 10 12 15 10 18 12 2 160 24) 1d. a 58) 
LOO 83h 9 58) Te 166214 8 11 6 3 ll 16 12 12) 01 A 15) 48 
Saree! uno vs 15; Ors: ie ok Sepsde Hh Eh eS 1 tO) 78 3: 
10 1 18 16 13 16 11 6 4 IL 4 114 6 15 9756.55: 1G). a1 
aN i a 8 6 16 10 2 5 3 2 13 8 14 12 57 14 «6 
1 13 10 14 8 hin Oe Pie Onn S: iO; 82: 32) 42°97 3 8 10 10 14 
Oh S26 35) 312 2 4 9 16867 10 3 10 Af TG) Ta 9) 
13) '6) 16) ads 4 9 7 16 4 9143841 4 4 112 4 15 
56 12 1 10 10 12 10 3 1 10 11 10 16 15 13 10) 2 G8: Fo 
9 14 #8 13 1 115 13 11 14 14 12 13 -9 12 14063 2 9 10 
16 4 14 9 9 02 6D SO) ke! eG: 1 8 4 4 14 7 14 15 7 2 
ERs cle (de Vs yd mu 1 14 15 18 215 5 10 8 8 68 18 18 16 
Aad. 38: 68.8 7 "7- 10) %8: 72 1 4 8 li 16 io"c15, 80) al s 
fpr hte OSy amet 10 4 4 8 16 70: 33:1) 16" G2 5 13 16 8 6 
14 10 12 1 it Ce we ied Merc Te Woo st Br Ga 1 97 8: Ye 
816 9 5 15 iO) FS c0y 2 7s: E276" 19! 28s: 210 14 15 38 
12 38 4 15 16 133 «11 8 15 13°16 15 14 11 PGE 36} FG: 29) 
Bs Ae OS 214 12 13 5 3.99? #6 18 6. i 46 tl 2 4 
HT Oe td [atalino Haag CP} A Died) F418) 2 2. 18. 016: BL 
10 14 38 13 4 5 13 9 10 8 6 10 14 16 11 U8) Re Als Ve. 29) 
4 15 14 11 12 10 11 8 15 12 911 9 6 4 14 10 14 11 3 
16 12 10 4 14 Hi Seta, "7 33 wi) 8) 32) Fee ee ae 8) EL 08) 
2 9 16°16 16 cat Oe Dy ante ALO). 3 2 15 10 2 Sil AS: 10: e0 
7 10: (s (8 36, 14 15 10 12 13 BS: *S 18. jer ae ko) Sih FAN Ss: 
9 8 15 3 13 faa Sites Thee’ 10 6 4 12 3 145-5 18> 13; 
61 nts 10 Get 8 3 16 14 7 1612 16 7 5 6 6 12 15 5 
io) “6edt o7 *9 81446 211 13 14 5 13 12 M12 8 12) 
564 12 «15 ~ 56 Tas 5. 15) GE, 68) 55.30) BA i Pa 
ie a ae a a 9 10 144 11 9 2) iy 42) 44" 16 10 16 16 8 10 
fy Gale E208 eh es bb ‘4. 0 Beshe: Vt (CDRS Oy Gis Sh ot 45 TA Ie 
Len esr (2) ie 36: 1605 13 3 15 8) 467 13) oh) 29. Bi RAs 2: EB: 46) 
Bi16 =O) #2: -10) 32) 70) Dee e Tae FAS ME 19 ea! 21362 8 
C3 6 8 8 1312 7 4 14 14 13 10 2 16 15 6 10 5 16 
8 13 5 4 8 1 8 5 5 16 1 3 13 15 15 63 4-4 iB 
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RANDOM PERMUTATIONS OF 9 AND 16 NUMBERS 15.5 
TABLE 15.7 Permutations or 16 (Continued) 


4d 8 4 8 4 9 279) 5eai8) eS 15 10 15 13 38 
et? 14 16 144 9 6 84 4 8 4 6 6 10 5 16 
8 16 15 2 10 8 16 7 §2 122 Ge &S 13 2 13 15 10 

10 9 16 9 15 12 14 12 10 13 10 1 12S! SBP 5. 
wo, 118 AL AS 411 3 11 10 8 8 9.12 4 
1 oat 3 8 1 16 10 91547 4a 9 4 8 15 
5 13 TAO) oT HGe 12 15 4 11 16 6 mt ST 4a |G, 
414 6 14 11 13 2 6 11614 9 S6F pp cd 84) 31. 

1 2 OP ioe ae ES) 1! 10 16 2 3 18 B0DR@ 2 21 12 

12 12 215 12 4 7 137 15 14 4 4 14 10 7 
3.8 dt 12756" &% 3s. LeGS) Ste a Ez Seb 12) 16) “1 
2 6 12 64 2 4 8 18 9 12 5 Bec le 25) 68: 

11 10 10 7 8°15 15 1G). S26) BL 16 M115 il 11 14 

a Ti? 5 $e Wee PS: FAS 85 91E 0) FS: 2 12 18 3 2 

16064 ADL F205) 1 11 12 8 2 16 Wee nD. 9 AS 
6 15 1 5 13 10 8 5 5 10 5 12 Bd6? 662) 69, 
6 15 8 10 15 13 9 8 9 12 13 10 3p 46" 18. EO) 8 
4 4 41416 6 5 144811 4 2 SO5nG Ti 16) 16 
Le 16° FC ES Fa ek 6 3 16 12 14 Lpiz, 16 1 3 
3.5 1416 8 11 16 Ty Wa y4) ol 6 7°15 12 14 1 

1 3 82) 58. 58 a6. 9 12 10 7 13 BOX BE 18:59 
9 4 et C Seay (leas io aes 16 56: 46. 3589. iene) AO; 1 18; 

16 12 & OF 18 03) AL 163/302 *S! 38: FF Pople 1s. 27 12 

12 16 10 11 11 12 14 7 5 2 10 16 eB dds 85. EO: 43: 

14 2 Gs N60 ea 211 15 16 «8 410 2 10 4 

10 8 6 8 10 16 12 8) Acton ahd, eo Cen: “6. 93; 15: 

Lue JEL 1 3 3 8 4 Sele ST ok Ta 56 18 15 12 16 
5 10 BUD ark Bt yr 12 15 14 9 12 At BR AT BAS 
we 4u 1 15 14 15 10 10 RAS 268d 13 4 10 13 7 
8 9 139 1 2 15 4207 33 3B) AL eer ey eer L 

13 18 12 12 12 14 13 DR ally 42h 83, 4 14 16 14 
22 yr Si 2 PB 27° $8 18.7 Tp (9) 26. 21, 16) 1% 8. (4:10 

12 13 Lo eek 758 9. 56) 15) 15, «2 LOE PI 1G: 2 
Wee Pe Toe GB) 410 6 3 10 16 67 10 12 12 

1607 1115 5 4 15 812 1 2 15 3 16 16 6 7 

WwW 5 12 11 10 14 «14 1 16 10 13 9 1 5 8 15 14 
Los 15) P5010 325 jy Cres. ee 16.14 LOP ee t= Sl l6, 
2 6 1668 .7 18 12 16 37 39 10. 5: 12202 4 14 5 
5 14 7 14 12 «2 16 16 1 18 7 12 9) 9) IS: BOL NS 

13° 10 & 8 7H 2B. 8 10 2 2 14 18 8 4 6 13 13 

4 9 144 16 6 1 10 Le  eitin da Wo CAEN pre hi ht 
3 11 1B Eee 08s ud) abe 98> 38) At dh) 9) 20-15 
4 15 LE 8810253 7 14 14 11 16 214 15 4 il 

10 12 10 10 16 6 4 Sealy yh) ko. 3: TESTS. Coe Ss 

15 16 G9) sd AL 13) 99) eG; 26 1 3 5 3 10 
6 4 8, 82) 16, (8s. 7 1313 12 6 14 if 15; 14 8-9 
8 2 3 6 13 16 6 5 15 16 4 8 pends Aly io) 26. 
Uae 9 13 14 15 18 2S, £8: 16) 07! 6 12 12 2 4 
3.3 5 11 12 16 12 14 12 6 15 10 Ou Migie eB 8) 10 

Il 12 11 3 13 3 8 2.10. 2 2 45 15: WSL = 5 12, 

1 16 9 7 4 14 13 815 9 3 9 5 14 10 13 8 
1 10 16 2 6 16 9 6 Daas a 2s sb) Se wit) 110: 6 

13° 11 BUTE i Fy aay gaa § 12 5 12 10 14 NCS Ds ee Eee 
8 13 3 6 10 10 5 16616 8 4 7 a Vee Lan tie ee: § 

4 15 Ley 16 695d) BasSh yt OT AN Lhe! Si 57. 1s 

10 6 Bie Sh a4 4 13 10 9 2 6 16 4 7 
7.8 2 116 6 6 13 11 7 16 6 Sele, pa: vel i 
Cue! 8 13 3 1 10 C8 Ae: AS: 11 12 6 9 i 
4 7 IOVS: Li 1s" 68: 11-14 38 12.3 16613 5 16 4 

12 2 Golds Wea tai 14 2 14 1 Wl AB. 3D 

16069 410 14 4 14 6 8 16 6 16 ies 1S 6 sh): 
55 45 dt ST TS Ort Al 26a 55) 6 9 14 12 15 
2 14 2 38902 16 10 4 15 li 13 9 15 13 15 13 
6 4 1316 5 5 16 5 9 138 14 4 1363 4 14 4 
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TABLE 15.7 Permutations or 16 (Continued) 


18 {3 1% 8 9 Songts 2b) 9s: ee 15, 16igtie eo, ak Baton ety if) 7B 
Boh Sy ny 3) 4. 19 18) 4 413 3 4 5 6 10 16 1 8 
4 6 19 4 42 al Gee 7 2 “9° 16 15 i i 6 5 
B) dd eed as 10 3 14 7 16 115 4 12 9 Susi, 15-2 
10 8 1 10 16 14 10 10 16 2 Glaabt Sees © tueakt 4 16 9 13 12 
2 16 4 7 14 13.16 11 14 7 1663 14 7 6 inn 13) @% 16 
9 (9 18) 2) af 152 4 4 4a 3 12 13 16 16 Boe th 2 Uy 
14020) 070 Sea. fy x 20 a 5 4 10 138 7 yemsoy or 8: 10 
813 66hsi8 16, Sime Se eB 13 9 5 l4 4 5 15 10 12 15 
7 14 16 15 13 12.56 eset 8 515 38 8 net io) 8: 17. 
21 2° 8 #@ 5 1313 4 38 12; Ge Bie "8. Le 13 2 2 10 14 
Ti ae taney ts ame) Vib Suk Obl ipa ues 16 5 ll 16 6 
1) Mois? 52 13) 6: 28) 45 19) 9 10 7 16 5 10 fois te 9) 13 
6 12 15 14 4 4148 5 6 iis ie it i 1943 3 5 9 
13 1 10 16 2 7 8 16 15 15 14 14 11 10 12 Pe ee 
16 15 12 6 5 115 12 2 10 Serio, 1g) 19) 18 secu 16 At a 
7 th 13! Tou 35, 1373. 46: 16 12 12 10 2 14 isis) 10: 9 2 
9 14 16 14 16 1018 4 6 1 7 15 8 15 10 4 16 12 13 14 
OR Comey Bes PY Gutsnlre uy ght 9) Yb) 11 76. 32 9 8 115 4 
13 8 15 16 10 OG: 59. ab a eS ie? 8 8 15 18 10 8 
Be A hae 10 15 6 5 3 18 Bk aida ora 2 5 6 14 10 
1018 16: 18; 32 5 8 3 18° {7 16 10 15 8 3 ds <2. 7 3B 
16 8 9 13 18 4 id 2 at 2o 8 an 16 11 14 16 13 
Cah 1 i 14 Toy is it 18 2 6 4 16 9 een il 79) 19 
BsyO! eae goo Ae 8 11 15 4 10 15 14 13 11 6 (oh Chat aN Ra 
165 15 14 i 8 1 14 10 16 15 13; 59° \9) 3 W645 11 6 
By BS Pe Be 6. 6 16 11 10 8 i 8) WB as) eis 12 9 
2° 20) Aes an) 4 13 10 16 8 16 10 3 12 12 15 7 13 8 8 16 
410 12 6 1 8 8 4 12 12 W 37) 14. 0 13 4 16 1 15 
4 12 8 15 6 1) $9 66) 7 a de BC, (6. (5. 18 410 4 5 1 
1). 72) On 10) 8 415 7 15 2 5 16 5 1 16 6. 9% 18 16 
Bi GecER elxas 9 484 4 iat) 9738) if, UL Ee esel 
9 138 10 7 11 oi vat i 1611 15 9 2 12 4 10 10 1 
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sugar beet, 122 
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Type IV, ¢ = b, 322 
3 X 3 balanced lattice, 274 
completely randomized block design, 
87 
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quasi-latin square 
2° factorial in 4 X 4 square, 242 
randomized block design, 95 
3 X 4 rectangular lattice, 295 
series of experiments, 392 
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split-plot design, 224 
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and place variations, 412 
unweighted analysis, 404 
Experimental errors, 15 
Experimental material 
arrangement, 281, 302, 317, 348 
selection, 31 
Experimental unit, 15 
grouping of experimental units, 36 


F, variance ratio, 47, 51 
Factorial, 29 
Factorial experiments 
advantages of factorial 
factors independent, 124 
factors not independent, 125 
designs, 146 
examples 
4X 4 &X 3 factorial, 147 
2? factorial, 122 
4 X 2 factorial, 132 
3? factorial, 185, 189, 141 
general method of analysis, 138 
interpretation of analysis 
3 X 8 factorial, 139, 141 
main effects and interactions 
2” factorial series, notation, 127 
4 X 2 factorial, 132 
3° factorial, 135 
split-plot designs, 218 
Fractional replication, 189 
2° factorial, half replicate, 191 
2° factorial, blocks of 16 units, 192 
2° factorial, blocks of 8 units, 192 
2? factorial, blocks of 16 units, 193 
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plans, 120 
randomization, 118 
statistical analysis, 119 
use in design of 3? factorial, 135 


Half-plaid squares, 248 
statistical analysis and plans 
2 X (2°) factorial in a 4 X 4 square, 
250, 256 
2X (3 X 2) factorial in a 6X6 
square, 250, 256 
3X (3 X 2) factorial in a 6X6 
square, 250, 257 


Half-plaid squares, statistical analysis 
and plans, 2 X (2°) factorial in an 
8 X 8 square, 251, 257 
2X (24) factorial in an 8X8 
square, 251, 258 


Incomplete block designs 
balanced incomplete block designs, 259 
balanced incomplete blocks, 260, 315 
balanced lattices, 260, 274 
incomplete latin square, 261, 370 
lattice square, 260, 346 
choice of incomplete block design, 
270 
arranged in lattice squares and in- 
complete latin squares, 272 
arranged in randomized incomplete 
blocks, 271 
comparison with 
random controls, 269 
randomized block designs, 267 
split-plot designs, 269 
systematic controls, 269 
partially balanced incomplete block 
designs, 261 
partially balanced lattices, 261, 280 
partially balanced lattice squares, 
347 
statistical analysis 
without recovery of inter-block in- 
formation, 262 
with recovery of inter-block infor- 
mation, 264 
Incomplete latin squares, 261, 370 
missing data, 374 
plans, r < t, 379 
plans, r > t, 388 
randomization, 371 
small numbers of treatments, 375 
Type II, k = it — 1, 377 
Type IV, k = i +1, 378 
statistical analysis, 371 
Type I, with recovery of inter-block 
information, 372 
Type II, without recovery of inter- 
block information, 372 
Type Ia, repetition of Type I, 373 
Type Ia, repetition of Type IL 
374 
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Incomplete latin squares, summary of 
plans 
r <t, 370 
r >t, 376 
Inference, statistical, 3 
Interactions, 124, 127 
Interpretation, 1 


Latin squares, 103 
efficiency, 112 
missing values, 110 
plans, 119 
randomization, 106 
replications, number of, 105 
standard errors, 109 
statistical analysis, 106 
Lattice designs, 274 
balanced lattices, 260, 274 
missing data, 277 
randomization, 282 
statistical analysis, 274 
index to plans, 271 
in series of experiments, 405 
partially balanced lattices, 261, 280 
arrangement of experimental mate- 
rial, 281 
four and higher numbers of repli- 
cates, 281 
missing data, 290 
randomization, 282 
repetitions of design, 286 
simple lattices, 280 
statistical analysis, 282 
triple lattices, 280 
plans, 304 
rectangular lattices, 292 
plans, 311 
repetitions of design, 298 
statistical analysis, 294 
Lattice squares, 346 
arrangement of experimental material, 
348 
balanced lattice squares, 260, 346 
index to plans, 272 
in series of experiments, 405 
missing data, 357 
partially balanced squares, 347 
plans, 360 
randomization, 348 
statistical analysis, 348 
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Lattice squares, ( ) or fewer replica- 


2 
tions, 348 
(k + 1) replications, 352 
2(k + 1) replications, 357 
Limits of error, 23 


Main effects, 123 
Mathematical model, 39 
analysis of variance, 47 
covariance, 75 
incomplete block designs 
intra-block information, 262 
intra- and inter-block information, 
264 
missing data, 72 
repetitions in space and time, 411 
series of experiments, combined analy- 
sis, 395 
split-plot design, 219 
treatment effects, estimation of, 42 
Missing data, 72 
balanced incomplete blocks, 325 
balanced lattice, 277 
confounded factorials, 202 
incomplete latin squares, 374 
latin square, 110 
lattice squares, 357 
mathematical model, 72 
partially balanced lattices, 290 
randomized blocks, 98 
split-plot design, 227 


Number of figures, 54 


Objectives of experiment, 10 
Orthogonal, 57 
components, 397 


Permutations, random, 414 
construction, 414 
tables 
permutations of 9, 422 
permutations of 16, 428 
use, 414 
Plaid square designs 
statistical analysis and plans 
2 X 2 X (2°) factorial in an 8 X 8 
square, 251, 258 
2X 2 X (24) factorial in an 8 X 8 
square, 251, 258 


Planning of experiments, 9 
experimental material, 31 
objectives, 10 
replications, number of, 17 
selection of treatments, 11, 29 

Plans 
balanced incomplete blocks, 329 

index to plans, 271, 327 
completely randomized designs, 88 
confounded factorials 

2” series, 157, 194, 208 

index to plans, 186 
3” series, 164, 196, 212 
index to plans, 186 
mixed series, 174, 199, 214 
index to plans, 186, 188 
cubic lattice designs, 300 
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Quasi-latin squares, statistical analysis 
and plans 
2° factorial in a 4 X 4 square, 242, 
252 
2+ factorial in an 8 X 8 square, 244, 
252 
2° factorial in an 8 X 8 square, 244, 
253 
2° factorial in an 8 X 8 square, 245, 
253 
3° factorial in a 9 X 9 square, 245, 
254 
3* factorial in a 9 X 9 square, 245, 
255 
4 X 2? factorial in an 8 X 8 square, 
246, 256 
treatments applied to complete rows, 
248, 250 


factorial experiments, 146 
fractional replication, 189 
graeco-latin squares, 117, 120 
half-plaid squares, 256 
incomplete latin squares, 272, 379 
index, r < t, 370 
index, r > t, 376 
latin square designs, 103, 109 
lattice designs, 271 
balanced lattices, 304 
partially balanced designs, 304, 309 
lattice square designs, 360 
index of plans, 272 
plaid square designs, 258 
quasi-latin squares, 252 
rectangular lattice designs, 311 
split-plot design 
BCD confounded, 238 
for y X 2° factorial, 239 
simple, 218 
sub-unit treatments confounded, 237 
sub-unit treatments in latin square, 
235 
sub-unit treatments in strips, 232 
systematic arrangement, 231 
Pooled error mean square 
series of experiments, 406 
Precision, 16 


Quasi-latin squares, 241 
efficiency, 247 
randomization, 241 


treatments applied to complete rows 
and columns, 251 


Randomization 
function of randomization, 6 
more than 16 numbers, 416 
restricted, 8 
tables of random numbers 
9 numbers, 422 
16 numbers, 428 
tests of randomness, 416 
Randomized blocks, 94 
advantages, 94 
efficiency, 100 
missing data, 98 
randomization, 95 
standard error, 98 
statistical analysis, 95 
Rectangular lattices, 292 
see also Lattice designs 
Regression 
components of treatments, 1382 
quadratic component, 409 
Replications, 1 
for prescribed limits of error, 23 
table, 24 
for tests of significance, 17 
tables, 20, 21 
fractional replication experiments, 189 
in latin squares, 105 
in lattice designs, 282 
single replication experiments, 188 
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Series of experiments Standard error formulae, lattices, cubic, 


all experiments, same design, 394 
analysis, initial steps in, 391 
experiments, unequal size, 401 
heterogeneity of experimental error 
variances, 399 
heterogeneity of interaction variance, 
396 
lattice experiments, series of, 405 
mathematical model, 395 
numerical example, 392 
orthogonal components, 397 
repetitions in space and time, 411 
summary, combined analysis, 401 
treatments X places interactions test, 
407 
Simple effects, 122 
Split-plot designs, 218 
confounded designs with plots split, 
236 
repeated subdivision, 229 
standard errors, 230 
simple split-plot design, 218 
efficiency, 221 
example, numerical, 224 
experimental error, 219 
mathematical model, 219 
missing data, 227 
randomization, 222 
standard error, 223 
statistical analysis, 222 
sub-unit treatments 
confounded, 237 
in latin squares, ‘234 
in strips, 231 
systematic arrangement, 230 
Standard error formulae 
balanced incomplete blocks, 318 
completely randomized blocks, 87 
cross-over, 115 
factorials, 138 
graeco-latin squares, 118 
half-plaid squares, 250 
incomplete block, 262, 264 
incomplete latin squares, 371 
latin squares, 106 
lattice squares, 348 
lattices 
balanced, 274 


302 
partially balanced, 282 
rectangular, 294 
quasi-latin squares, 242 
randomized blocks, 95 
split plot, 222 
Standard errors, 17 
pooled error, 91 
rules for calculating, 64 
when variances heterogeneous, 91 
Statistical analysis, 39 
example, numerical, 49 
general method of analysis, 39 
analysis of variance, 47 
assumptions, 41 
effects of errors in assumptions, 83 
confidence limits, 44 
construction, 46 
estimation of treatment effects, 42 
mathematical model, 39 
tests of significance, 44 
Subdivision 
of error s.s., 69 
of treatments s.s., 55 
Sum of products, 76 


t-distribution 
approximate t’, 92 
errors different, 91 
Technique, 30 
Test of significance, 4, 44 
effects suggested by data, 67 
number of replications for, 20 
Treatments, 11 
comparison among means, 64, 71 
components, 56 
effects, estimating, 42 
linear functions, 56 
means, adjusted, 32 
selection of, 11, 29 
sum of squares 
incomplete subdivision, 59 
sample sets, 58 
single components, 56 
subdivision, 55 


Youden squares, see Incomplete latin 
squares 
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